Experiment overview
Setting | Value |
---|
Model for non-random steps | BOTORCH_MODULAR |
Max. nr. evaluations | 500 |
Number random steps | 20 |
Nr. of workers (parameter) | 20 |
Main process memory (GB) | 8 |
Worker memory (GB) | 10 |
Job Summary per Generation Node
Generation Node | Total | COMPLETED | FAILED | RUNNING |
SOBOL | 13 | 1 | 8 | 4 |
Experiment parameters
Name | Type | Lower bound | Upper bound | Values | Type | Log Scale? |
---|
epochs | range | 10 | 200 | | int | No |
lr | range | 1e-05 | 0.1 | | float | No |
batch_size | range | 8 | 2048 | | int | No |
hidden_size | range | 8 | 2048 | | int | No |
dropout | range | 0 | 0.5 | | float | No |
activation | fixed | | | leaky_relu | | |
num_dense_layers | range | 1 | 4 | | int | No |
init | fixed | | | normal | | |
weight_decay | range | 0 | 1 | | float | No |
Number of evaluations
Failed |
Succeeded |
Running |
Total |
8 |
1 |
4 |
13 |
Result names and types
Last progressbar status
2025-07-31 17:12:28: Sobol, failed: 8 ('VAL_ACC: <FLOAT>' not found), best VAL_ACC: 3.67, running/unknown 3/1∑4 (20%/20), started new job
Git-Version
Commit: f9547a580b93e0983ebff52a5b7750569294ad57
trial_index,submit_time,queue_time,start_time,end_time,run_time,program_string,exit_code,signal,hostname,OO_Info_SLURM_JOB_ID,arm_name,trial_status,generation_node,VAL_ACC,epochs,lr,batch_size,hidden_size,dropout,num_dense_layers,weight_decay,activation,init
0,,,,,,,,,,,0_0,RUNNING,SOBOL,,118,0.065774644992351527506002639711,1691,1295,0.21284355223178863525390625,3,0.01254769600927829742431640625,leaky_relu,normal
1,1753974072,36,1753974108,1753974575,467,python3 .tests/mnist/train --epochs 80 --learning_rate 0.02334596429724246358 --batch_size 436 --hidden_size 1004 --dropout 0.25205665826797485352 --activation leaky_relu --num_dense_layers 1 --init normal --weight_decay 0.57216477766633033752,0,,c149,531228,1_0,COMPLETED,SOBOL,3.66999999999999992894572642399,80,0.023345964297242463580950300184,436,1004,0.252056658267974853515625,1,0.5721647776663303375244140625,leaky_relu,normal
2,1753974122,16,1753974138,1753974168,30,python3 .tests/mnist/train --epochs 46 --learning_rate 0.09713359326949343175 --batch_size 1117 --hidden_size 1727 --dropout 0.41099013993516564369 --activation leaky_relu --num_dense_layers 2 --init normal --weight_decay 0.25285878218710422516,1,,c137,531229,2_0,FAILED,SOBOL,,46,0.097133593269493431754391110644,1117,1727,0.410990139935165643692016601562,2,0.25285878218710422515869140625,leaky_relu,normal
3,1753974153,14,1753974167,1753974197,30,python3 .tests/mnist/train --epochs 199 --learning_rate 0.04191316491941922406 --batch_size 884 --hidden_size 101 --dropout 0.12421447457745671272 --activation leaky_relu --num_dense_layers 4 --init normal --weight_decay 0.82013589516282081604,1,,c137,531234,3_0,FAILED,SOBOL,,199,0.041913164919419224063723561358,884,101,0.124214474577456712722778320312,4,0.8201358951628208160400390625,leaky_relu,normal
4,1753974207,20,1753974227,1753974257,30,python3 .tests/mnist/train --epochs 168 --learning_rate 0.08036519073177129935 --batch_size 90 --hidden_size 303 --dropout 0.04812996461987495422 --activation leaky_relu --num_dense_layers 3 --init normal --weight_decay 0.47574356291443109512,1,,c137,531235,4_0,FAILED,SOBOL,,168,0.080365190731771299348373815974,90,303,0.0481299646198749542236328125,3,0.475743562914431095123291015625,leaky_relu,normal
5,1753974278,31,1753974309,1753974339,30,python3 .tests/mnist/train --epochs 31 --learning_rate 0.03686539158684202372 --batch_size 1893 --hidden_size 1997 --dropout 0.47912774048745632172 --activation leaky_relu --num_dense_layers 1 --init normal --weight_decay 0.90828537847846746445,1,,c137,531239,5_0,FAILED,SOBOL,,31,0.036865391586842023718961769418,1893,1997,0.47912774048745632171630859375,1,0.908285378478467464447021484375,leaky_relu,normal
6,1753974361,17,1753974378,1753974408,30,python3 .tests/mnist/train --epochs 96 --learning_rate 0.06144834716449492501 --batch_size 665 --hidden_size 675 --dropout 0.32459180103614926338 --activation leaky_relu --num_dense_layers 2 --init normal --weight_decay 0.22765144798904657364,1,,c137,531240,6_0,FAILED,SOBOL,,96,0.061448347164494925010114201314,665,675,0.324591801036149263381958007812,2,0.227651447989046573638916015625,leaky_relu,normal
7,1753974409,29,1753974438,1753974468,30,python3 .tests/mnist/train --epochs 150 --learning_rate 0.00574279837509617235 --batch_size 1448 --hidden_size 1154 --dropout 0.14798459643498063087 --activation leaky_relu --num_dense_layers 4 --init normal --weight_decay 0.66809719707816839218,1,,c137,531243,7_0,FAILED,SOBOL,,150,0.00574279837509617235163927873,1448,1154,0.147984596434980630874633789062,4,0.668097197078168392181396484375,leaky_relu,normal
8,1753974462,41,1753974503,1753974539,36,python3 .tests/mnist/train --epochs 135 --learning_rate 0.09142146977110766903 --batch_size 645 --hidden_size 1914 --dropout 0.29632516112178564072 --activation leaky_relu --num_dense_layers 4 --init normal --weight_decay 0.38379055727273225784,1,,c137,531245,8_0,FAILED,SOBOL,,135,0.091421469771107669033405329628,645,1914,0.296325161121785640716552734375,4,0.383790557272732257843017578125,leaky_relu,normal
9,1753974533,24,1753974557,1753974600,43,python3 .tests/mnist/train --epochs 87 --learning_rate 0.04762528841780499372 --batch_size 1356 --hidden_size 417 --dropout 0.23879745323210954666 --activation leaky_relu --num_dense_layers 2 --init normal --weight_decay 0.94945039879530668259,1,,c137,531248,9_0,FAILED,SOBOL,,87,0.047625288417804993723603246281,1356,417,0.238797453232109546661376953125,2,0.949450398795306682586669921875,leaky_relu,normal
10,,,,,,,,,,,10_0,RUNNING,SOBOL,,16,0.073049490599259733758508161827,197,1234,0.080348681192845106124877929688,1,0.132035260088741779327392578125,leaky_relu,normal
11,,,,,,,,,,,11_0,RUNNING,SOBOL,,158,0.016071115989768878368204596541,1930,563,0.384454274084419012069702148438,3,0.689485992304980754852294921875,leaky_relu,normal
12,,,,,,,,,,,12_0,RUNNING,SOBOL,,184,0.056224445844460284316124187853,1208,864,0.445146777667105197906494140625,4,0.09662049822509288787841796875,leaky_relu,normal
To cancel, press CTRL c, then run 'scancel 531216'
โ Importing logging...
โ Importing warnings...
โ Importing argparse...
โ Importing datetime...
โ Importing dataclass...
โ Importing hashlib...
โ Importing socket...
โ Importing stat...
โ Importing pwd...
โ Importing signal...
โ Importing base64...
โ Importing json...
โ Importing yaml...
โ Importing toml...
โ Importing csv...
โ Importing ast...
โ Importing rich.table...
โ Importing rich print...
โ Importing rich.pretty...
โ Importing rich.prompt...
โ Importing types.FunctionType...
โ Importing typing...
โ Importing ThreadPoolExecutor...
โ Importing submitit.LocalExecutor...
โ Importing submitit.Job...
โ Importing importlib.util...
โ Importing inspect...
โ Importing platform...
โ Importing inspect frame info...
โ Importing pathlib.Path...
โ Importing uuid...
โ Importing traceback...
โ Importing cowsay...
โ Importing psutil...
โ Importing shutil...
โ Importing itertools.combinations...
โ Importing os.listdir...
โ Importing os.path...
โ Importing PIL.Image...
โ Importing sixel...
โ Importing subprocess...
โ Importing tqdm...
โ ด Importing beartype...
โ Importing statistics...
โ Trying to import pyfiglet...
โ ง Importing helpers...
โ Parsing arguments...
โ ธ Importing torch...
โ Importing numpy...
โ Importing collections...
โ ฆ Importing ax...
โ Importing ax.core.generator_run...
โ Importing Cont_X_trans and Y_trans from ax.modelbridge.registry...
โ Importing ax.core.arm...
โ Importing ax.core.objective...
โ Importing ax.core.Metric...
โ Importing ax.exceptions.core...
โ Importing ax.exceptions.generation_strategy...
โ Importing CORE_DECODER_REGISTRY...
โ Trying ax.generation_strategy.generation_node...
โ Importing GenerationStep, GenerationStrategy from generation_strategy...
โ Importing GenerationNode from generation_node...
โ Importing ExternalGenerationNode...
โ Importing MaxTrials...
โ Importing GeneratorSpec...
โ Importing Models from ax.modelbridge.registry...
โ Importing get_pending_observation_features...
โ Importing load_experiment...
โ Importing save_experiment...
โ Importing save_experiment_to_db...
โ Importing TrialStatus...
โ Importing Data...
โ Importing Experiment...
โ Importing parameter types...
โ Importing TParameterization...
โ Importing pandas...
โ Importing AxClient and ObjectiveProperties...
โ Importing RandomForestRegressor...
โ Importing botorch...
โ Importing submitit...
โ Importing ax logger...
โ Importing SQL-Storage-Stuff...
Run-UUID: 44a8e3d1-d1c7-4334-9e97-e0bed003dead
_________________________________________________
/ \
| OmniOpt2 - Adjusting the dials, one click at a ti |
| me. |
\ /
=================================================
\
\
\
\
,.
(_|,.
,' /, )_______ _
__j o``-' `.'-)'
(") \'
`-j |
`-._( /
|_\ |--^. /
/_]'|_| /_)_/
/_]' /_]'
โ Writing worker creation log...
omniopt --partition=alpha --experiment_name=mnist_gpu_noall --mem_gb=10 --time=2880 --worker_timeout=120 --max_eval=500 --num_parallel_jobs=20 --gpus=1 --num_random_steps=20 --follow --live_share --send_anonymized_usage_stats --result_names VAL_ACC=max --run_program=cHl0aG9uMyAudGVzdHMvbW5pc3QvdHJhaW4gLS1lcG9jaHMgJWVwb2NocyAtLWxlYXJuaW5nX3JhdGUgJWxyIC0tYmF0Y2hfc2l6ZSAlYmF0Y2hfc2l6ZSAtLWhpZGRlbl9zaXplICVoaWRkZW5fc2l6ZSAtLWRyb3BvdXQgJWRyb3BvdXQgLS1hY3RpdmF0aW9uICVhY3RpdmF0aW9uIC0tbnVtX2RlbnNlX2xheWVycyAlbnVtX2RlbnNlX2xheWVycyAtLWluaXQgJWluaXQgLS13ZWlnaHRfZGVjYXkgJXdlaWdodF9kZWNheQ== --cpus_per_task=1 --nodes_per_job=1 --revert_to_random_when_seemingly_exhausted --model=BOTORCH_MODULAR --n_estimators_randomforest=100 --run_mode=local --occ_type=euclid --main_process_gb=8 --max_nr_of_zero_results=50 --slurm_signal_delay_s=0 --max_failed_jobs=0 --max_attempts_for_generation=20 --num_restarts=20 --raw_samples=1024 --max_abandoned_retrial=20 --max_num_of_parallel_sruns=16 --parameter epochs range 10 200 int false --parameter lr range 0.00001 0.1 float false --parameter batch_size range 8 2048 int false --parameter hidden_size range 8 2048 int false --parameter dropout range 0 0.5 float false --parameter activation fixed leaky_relu --parameter num_dense_layers range 1 4 int false --parameter init fixed normal --parameter weight_decay range 0 1 float false --ui_url aHR0cHM6Ly9pbWFnZXNlZy5zY2Fkcy5kZS9vbW5pYXgvZ3VpP3BhcnRpdGlvbj1hbHBoYSZleHBlcmltZW50X25hbWU9bW5pc3RfZ3B1X25vYWxsJnJlc2VydmF0aW9uPSZhY2NvdW50PSZtZW1fZ2I9MTAmdGltZT0yODgwJndvcmtlcl90aW1lb3V0PTEyMCZtYXhfZXZhbD01MDAmbnVtX3BhcmFsbGVsX2pvYnM9MjAmZ3B1cz0xJm51bV9yYW5kb21fc3RlcHM9MjAmZm9sbG93PTEmbGl2ZV9zaGFyZT0xJnNlbmRfYW5vbnltaXplZF91c2FnZV9zdGF0cz0xJmNvbnN0cmFpbnRzPSZyZXN1bHRfbmFtZXM9VkFMX0FDQyUzRG1heCZydW5fcHJvZ3JhbT1weXRob24zJTIwLnRlc3RzJTJGbW5pc3QlMkZ0cmFpbiUyMC0tZXBvY2hzJTIwJTI1ZXBvY2hzJTIwLS1sZWFybmluZ19yYXRlJTIwJTI1bHIlMjAtLWJhdGNoX3NpemUlMjAlMjViYXRjaF9zaXplJTIwLS1oaWRkZW5fc2l6ZSUyMCUyNWhpZGRlbl9zaXplJTIwLS1kcm9wb3V0JTIwJTI1ZHJvcG91dCUyMC0tYWN0aXZhdGlvbiUyMCUyNWFjdGl2YXRpb24lMjAtLW51bV9kZW5zZV9sYXllcnMlMjAlMjVudW1fZGVuc2VfbGF5ZXJzJTIwLS1pbml0JTIwJTI1aW5pdCUyMC0td2VpZ2h0X2RlY2F5JTIwJTI1d2VpZ2h0X2RlY2F5JmNwdXNfcGVyX3Rhc2s9MSZub2Rlc19wZXJfam9iPTEmc2VlZD0mZHJ5cnVuPTAmZGVidWc9MCZyZXZlcnRfdG9fcmFuZG9tX3doZW5fc2VlbWluZ2x5X2V4aGF1c3RlZD0xJmdyaWRzZWFyY2g9MCZtb2RlbD1CT1RPUkNIX01PRFVMQVImZXh0ZXJuYWxfZ2VuZXJhdG9yPSZuX2VzdGltYXRvcnNfcmFuZG9tZm9yZXN0PTEwMCZpbnN0YWxsYXRpb25fbWV0aG9kPWNsb25lJnJ1bl9tb2RlPWxvY2FsJmRpc2FibGVfdHFkbT0wJnZlcmJvc2VfdHFkbT0wJmZvcmNlX2xvY2FsX2V4ZWN1dGlvbj0wJmF1dG9fZXhjbHVkZV9kZWZlY3RpdmVfaG9zdHM9MCZzaG93X3NpeGVsX2dlbmVyYWw9MCZzaG93X3NpeGVsX3RyaWFsX2luZGV4X3Jlc3VsdD0wJnNob3dfc2l4ZWxfc2NhdHRlcj0wJnNob3dfd29ya2VyX3BlcmNlbnRhZ2VfdGFibGVfYXRfZW5kPTAmb2NjPTAmb2NjX3R5cGU9ZXVjbGlkJm5vX3NsZWVwPTAmc2x1cm1fdXNlX3NydW49MCZ2ZXJib3NlX2JyZWFrX3J1bl9zZWFyY2hfdGFibGU9MCZhYmJyZXZpYXRlX2pvYl9uYW1lcz0wJm1haW5fcHJvY2Vzc19nYj04Jm1heF9ucl9vZl96ZXJvX3Jlc3VsdHM9NTAmc2x1cm1fc2lnbmFsX2RlbGF5X3M9MCZtYXhfZmFpbGVkX2pvYnM9MCZleGNsdWRlPSZ1c2VybmFtZT0mZ2VuZXJhdGlvbl9zdHJhdGVneT0mcm9vdF92ZW52X2Rpcj0md29ya2Rpcj0mZG9udF9qaXRfY29tcGlsZT0wJmZpdF9vdXRfb2ZfZGVzaWduPTAmcmVmaXRfb25fY3Y9MCZzaG93X2dlbmVyYXRlX3RpbWVfdGFibGU9MCZkb250X3dhcm1fc3RhcnRfcmVmaXR0aW5nPTAmbWF4X2F0dGVtcHRzX2Zvcl9nZW5lcmF0aW9uPTIwJm51bV9yZXN0YXJ0cz0yMCZyYXdfc2FtcGxlcz0xMDI0Jm1heF9hYmFuZG9uZWRfcmV0cmlhbD0yMCZtYXhfbnVtX29mX3BhcmFsbGVsX3NydW5zPTE2JmZvcmNlX2Nob2ljZV9mb3JfcmFuZ2VzPTAmbm9fdHJhbnNmb3JtX2lucHV0cz0wJmZpdF9hYmFuZG9uZWQ9MCZub19ub3JtYWxpemVfeT0wJnZlcmJvc2U9MCZnZW5lcmF0ZV9hbGxfam9ic19hdF9vbmNlPTAmZmxhbWVfZ3JhcGg9MCZjaGVja291dF90b19sYXRlc3RfdGVzdGVkX3ZlcnNpb249MCZwYXJhbWV0ZXJfMF9uYW1lPWVwb2NocyZwYXJhbWV0ZXJfMF90eXBlPXJhbmdlJnBhcmFtZXRlcl8wX21pbj0xMCZwYXJhbWV0ZXJfMF9tYXg9MjAwJnBhcmFtZXRlcl8wX251bWJlcl90eXBlPWludCZwYXJhbWV0ZXJfMF9sb2dfc2NhbGU9ZmFsc2UmcGFyYW1ldGVyXzFfbmFtZT1sciZwYXJhbWV0ZXJfMV90eXBlPXJhbmdlJnBhcmFtZXRlcl8xX21pbj0wLjAwMDAxJnBhcmFtZXRlcl8xX21heD0wLjEmcGFyYW1ldGVyXzFfbnVtYmVyX3R5cGU9ZmxvYXQmcGFyYW1ldGVyXzFfbG9nX3NjYWxlPWZhbHNlJnBhcmFtZXRlcl8yX25hbWU9YmF0Y2hfc2l6ZSZwYXJhbWV0ZXJfMl90eXBlPXJhbmdlJnBhcmFtZXRlcl8yX21pbj04JnBhcmFtZXRlcl8yX21heD0yMDQ4JnBhcmFtZXRlcl8yX251bWJlcl90eXBlPWludCZwYXJhbWV0ZXJfMl9sb2dfc2NhbGU9ZmFsc2UmcGFyYW1ldGVyXzNfbmFtZT1oaWRkZW5fc2l6ZSZwYXJhbWV0ZXJfM190eXBlPXJhbmdlJnBhcmFtZXRlcl8zX21pbj04JnBhcmFtZXRlcl8zX21heD0yMDQ4JnBhcmFtZXRlcl8zX251bWJlcl90eXBlPWludCZwYXJhbWV0ZXJfM19sb2dfc2NhbGU9ZmFsc2UmcGFyYW1ldGVyXzRfbmFtZT1kcm9wb3V0JnBhcmFtZXRlcl80X3R5cGU9cmFuZ2UmcGFyYW1ldGVyXzRfbWluPTAmcGFyYW1ldGVyXzRfbWF4PTAuNSZwYXJhbWV0ZXJfNF9udW1iZXJfdHlwZT1mbG9hdCZwYXJhbWV0ZXJfNF9sb2dfc2NhbGU9ZmFsc2UmcGFyYW1ldGVyXzVfbmFtZT1hY3RpdmF0aW9uJnBhcmFtZXRlcl81X3R5cGU9Zml4ZWQmcGFyYW1ldGVyXzVfdmFsdWU9bGVha3lfcmVsdSZwYXJhbWV0ZXJfNl9uYW1lPW51bV9kZW5zZV9sYXllcnMmcGFyYW1ldGVyXzZfdHlwZT1yYW5nZSZwYXJhbWV0ZXJfNl9taW49MSZwYXJhbWV0ZXJfNl9tYXg9NCZwYXJhbWV0ZXJfNl9udW1iZXJfdHlwZT1pbnQmcGFyYW1ldGVyXzZfbG9nX3NjYWxlPWZhbHNlJnBhcmFtZXRlcl83X25hbWU9aW5pdCZwYXJhbWV0ZXJfN190eXBlPWZpeGVkJnBhcmFtZXRlcl83X3ZhbHVlPW5vcm1hbCZwYXJhbWV0ZXJfOF9uYW1lPXdlaWdodF9kZWNheSZwYXJhbWV0ZXJfOF90eXBlPXJhbmdlJnBhcmFtZXRlcl84X21pbj0wJnBhcmFtZXRlcl84X21heD0xJnBhcmFtZXRlcl84X251bWJlcl90eXBlPWZsb2F0JnBhcmFtZXRlcl84X2xvZ19zY2FsZT1mYWxzZSZwYXJ0aXRpb249YWxwaGEmbnVtX3BhcmFtZXRlcnM9OQ==
โ Disabling logging...
โ Setting run folder...
โ Creating folder /data/cat/ws/pwinkler-mnist_tst/omniopt/runs/mnist_gpu_noall/3...
โ Writing revert_to_random_when_seemingly_exhausted file ...
โ Writing username state file...
โ Writing result names file...
โ Writing result min/max file...
โ Saving state files...
Run-folder: /data/cat/ws/pwinkler-mnist_tst/omniopt/runs/mnist_gpu_noall/3
โ Printing run info...
โ Initializing NVIDIA-Logs...
โ Writing ui_url file if it is present...
โ Writing live_share file if it is present...
โ Writing job_start_time file...
โ Writing git info file...
โ Checking max_eval...
โ Calculating number of steps...
โ Adding excluded nodes...
โ Handling random steps...
โ Initializing ax_client...
[WARNING 07-31 16:59:53] ax.service.ax_client: Selecting a GenerationStrategy when using BatchTrials is in beta. Double check the recommended strategy matches your expectations.
โ Setting orchestrator...
You have 1 CPUs available for the main process. Using CUDA device NVIDIA H100. Generation strategy: SOBOL for 20 steps and then BOTORCH_MODULAR for 480 steps.
Run-Program: python3 .tests/mnist/train --epochs %epochs --learning_rate %lr --batch_size %batch_size --hidden_size %hidden_size --dropout %dropout --activation %activation --num_dense_layers %num_dense_layers --init %init --weight_decay %weight_decay
Experiment parameters
โโโโโโโโโโโโโโโโโโโโณโโโโโโโโณโโโโโโโโโโโโโโณโโโโโโโโโโโโโโณโโโโโโโโโโโโโณโโโโโโโโณโโโโโโโโโโโโโ
โ Name โ Type โ Lower bound โ Upper bound โ Values โ Type โ Log Scale? โ
โกโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฉ
โ epochs โ range โ 10 โ 200 โ โ int โ No โ
โ lr โ range โ 1e-05 โ 0.1 โ โ float โ No โ
โ batch_size โ range โ 8 โ 2048 โ โ int โ No โ
โ hidden_size โ range โ 8 โ 2048 โ โ int โ No โ
โ dropout โ range โ 0 โ 0.5 โ โ float โ No โ
โ activation โ fixed โ โ โ leaky_relu โ โ โ
โ num_dense_layers โ range โ 1 โ 4 โ โ int โ No โ
โ init โ fixed โ โ โ normal โ โ โ
โ weight_decay โ range โ 0 โ 1 โ โ float โ No โ
โโโโโโโโโโโโโโโโโโโโดโโโโโโโโดโโโโโโโโโโโโโโดโโโโโโโโโโโโโโดโโโโโโโโโโโโโดโโโโโโโโดโโโโโโโโโโโโโ
Result-Names
โโโโโโโโโโโโโโโณโโโโโโโโโโโโโโ
โ Result-Name โ Min or max? โ
โกโโโโโโโโโโโโโโโโโโโโโโโโโโโโฉ
โ VAL_ACC โ max โ
โโโโโโโโโโโโโโโดโโโโโโโโโโโโโโ
See https://imageseg.scads.de/omniax/share?user_id=pwinkler&experiment_name=mnist_gpu_noall&run_nr=10 for live-results.
โโโโโโโย โโย โโโโโโย โโโย โโย โโโย ย โโโโโโโ
โย โโโย โย โโโโย โย โโโย โโโโโโโโโโย โย โโโย โ
โย โโโย โย โย โย โโโโโโโโโย ย โย โโโโย โย โโโย โ
โโโโโโโย โย โย โย โโโย โโโโโโโย โโโย โโโโโโโ
โโโโโย โโโโโย โโโโโโโโโย โโย โโโโโย โย โโโย
โโโโโโโโโโโโโโย โโย ย ย โโโย โย ย โโโโโโโโโโ
โย โโย โโโโย โโโโโย โโโโโโโโย โโย โโโโโโโโโ
โโย ย โย โโโย โย โย โโโย ย โโโโย ย ย โโโโโโโโโโโ
โย โโย โโโโโย ย โโโย โโโโโโโโโโโย โโโย โโโโโ
โโโโโโโโโโโโโโโโโย ย ย โโย ย โโย โโโโโโย โโโ
ย ย โโย โโโย โโโโโโโโโโโโโย โโโโย โโโโโโโโโ
ย โโโโโโโโโโย โโโโโโโย โโโย โย โโโโโโย ย ย โโ
โย โย โย โย โย ย ย โโโโโโโโโโโโโโโย โย โโโโโโโ
โย โย โย โโโย โโโโโโย ย โย โโโโย โโโโโย โโย ย โโ
โย โย โโโย โโย โโโโโโโย โโย โโย โโโโโโโโโโย ย
โโโโโโโย โย โย โโย โโโโโย โย โโย โโโย โย โย ย โโ
โย โโโย โย โโโโโโโโโโย โโโโโย โโโโโโโโโโย โ
โย โโโย โย โโย ย โย โโโย โย โโย ย ย โโโโโโโย ย โโโ
โโโโโโโย โโโย ย โย ย โย ย ย ย ย โย โโโย ย ย โโย โโโโ
Sobol, failed: 8 ('VAL_ACC: ' not found), best VAL_ACC: 3.67, running 3โ3 (15%/20), starting new job : 0%|โโโโโโโโโโ| 1/500 [12:08<85:57:43, 620.17s/it]
Runtime (end): 13 minutes and 21 seconds, PID: 1708349
Sobol, failed: 8 ('VAL_ACC: ' not found), best VAL_ACC: 3.67, running/unknown 3/1โ4 (20%/20), started new job : 0%|โโโโโโโโโโ| 1/500 [12:31<85:57:43, 620.17s/it]
2025-07-31 16:59:57: SOBOL, Started OmniOpt2 run...
2025-07-31 17:00:06: Sobol, getting new HP set
2025-07-31 17:00:13: Sobol, requested 1 jobs, got 1, 8.23 s/job
2025-07-31 17:00:17: Sobol, eval #1/1 start
2025-07-31 17:00:21: Sobol, starting new job
2025-07-31 17:00:43: Sobol, unknown 1∑1 (5%/20), started new job
2025-07-31 17:00:51: Sobol, pending 1∑1 (5%/20), getting new HP set
2025-07-31 17:00:59: Sobol, pending 1∑1 (5%/20), requested 1 jobs, got 1, 8.24 s/job
2025-07-31 17:01:03: Sobol, pending 1∑1 (5%/20), eval #1/1 start
2025-07-31 17:01:08: Sobol, pending 1∑1 (5%/20), starting new job
2025-07-31 17:01:13: Sobol, running/unknown 1/1∑2 (10%/20), started new job
2025-07-31 17:01:18: Sobol, running/pending 1/1∑2 (10%/20), getting new HP set
2025-07-31 17:01:48: Sobol, running 2∑2 (10%/20), requested 1 jobs, got 1, 30.01 s/job
2025-07-31 17:01:54: Sobol, running 2∑2 (10%/20), eval #1/1 start
2025-07-31 17:01:58: Sobol, running 2∑2 (10%/20), starting new job
2025-07-31 17:02:04: Sobol, running/unknown 2/1∑3 (15%/20), started new job
2025-07-31 17:02:09: Sobol, running/pending 2/1∑3 (15%/20), getting new HP set
2025-07-31 17:02:19: Sobol, running 3∑3 (15%/20), requested 1 jobs, got 1, 10.41 s/job
2025-07-31 17:02:24: Sobol, running 3∑3 (15%/20), eval #1/1 start
2025-07-31 17:02:29: Sobol, running 3∑3 (15%/20), starting new job
2025-07-31 17:02:34: Sobol, running/unknown 3/1∑4 (20%/20), started new job
2025-07-31 17:02:40: Sobol, running/pending 3/1∑4 (20%/20), getting new HP set
2025-07-31 17:02:49: Sobol, running/completed 3/1∑4 (15%/20), requested 1 jobs, got 1, 9.09 s/job
2025-07-31 17:02:54: Sobol, running/completed 3/1∑4 (15%/20), eval #1/1 start
2025-07-31 17:03:05: Sobol, running/completed 3/1∑4 (15%/20), starting new job
2025-07-31 17:03:29: Sobol, running/completed/unknown 2/2/1∑5 (15%/20), started new job
2025-07-31 17:03:40: Sobol, running/completed/pending 2/2/1∑5 (15%/20), job_failed
2025-07-31 17:03:40: Sobol, running/completed/pending 2/2/1∑5 (15%/20), job_failed
2025-07-31 17:04:02: Sobol, failed: 2 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), finishing jobs (_get_next_trials), finished 2 jobs
2025-07-31 17:04:07: Sobol, failed: 2 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), getting new HP set
2025-07-31 17:04:17: Sobol, failed: 2 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), requested 1 jobs, got 1, 9.88 s/job
2025-07-31 17:04:23: Sobol, failed: 2 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (10%/20), eval #1/1 start
2025-07-31 17:04:34: Sobol, failed: 2 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (10%/20), starting new job
2025-07-31 17:04:40: Sobol, failed: 2 ('VAL_ACC: <FLOAT>' not found), running/completed/unknown 2/1/1∑4 (15%/20), started new job
2025-07-31 17:05:06: Sobol, failed: 2 ('VAL_ACC: <FLOAT>' not found), running/completed 3/1∑4 (15%/20), job_failed
2025-07-31 17:05:18: Sobol, failed: 3 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), finishing jobs (_get_next_trials), finished 1 job
2025-07-31 17:05:23: Sobol, failed: 3 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), getting new HP set
2025-07-31 17:05:32: Sobol, failed: 3 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), requested 1 jobs, got 1, 9.93 s/job
2025-07-31 17:05:46: Sobol, failed: 3 ('VAL_ACC: <FLOAT>' not found), running/completed 2/1∑3 (10%/20), eval #1/1 start
2025-07-31 17:05:56: Sobol, failed: 3 ('VAL_ACC: <FLOAT>' not found), running/completed 2/1∑3 (10%/20), starting new job
2025-07-31 17:06:02: Sobol, failed: 3 ('VAL_ACC: <FLOAT>' not found), running/completed/unknown 2/1/1∑4 (15%/20), started new job
2025-07-31 17:06:08: Sobol, failed: 3 ('VAL_ACC: <FLOAT>' not found), running/completed/pending 2/1/1∑4 (15%/20), job_failed
2025-07-31 17:06:17: Sobol, failed: 4 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), finishing jobs (_get_next_trials), finished 1 job
2025-07-31 17:06:23: Sobol, failed: 4 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), getting new HP set
2025-07-31 17:06:33: Sobol, failed: 4 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), requested 1 jobs, got 1, 10.27 s/job
2025-07-31 17:06:37: Sobol, failed: 4 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), eval #1/1 start
2025-07-31 17:06:43: Sobol, failed: 4 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), starting new job
2025-07-31 17:06:50: Sobol, failed: 4 ('VAL_ACC: <FLOAT>' not found), running/unknown 3/1∑4 (15%/20), started new job
2025-07-31 17:07:00: Sobol, failed: 4 ('VAL_ACC: <FLOAT>' not found), running/completed/pending 2/1/1∑4 (15%/20), job_failed
2025-07-31 17:07:09: Sobol, failed: 5 ('VAL_ACC: <FLOAT>' not found), running/pending 2/1∑3 (15%/20), finishing jobs (_get_next_trials), finished 1 job
2025-07-31 17:07:19: Sobol, failed: 5 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), getting new HP set
2025-07-31 17:07:28: Sobol, failed: 5 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), requested 1 jobs, got 1, 14.47 s/job
2025-07-31 17:07:33: Sobol, failed: 5 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), eval #1/1 start
2025-07-31 17:07:37: Sobol, failed: 5 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), starting new job
2025-07-31 17:07:43: Sobol, failed: 5 ('VAL_ACC: <FLOAT>' not found), running/unknown 3/1∑4 (20%/20), started new job
2025-07-31 17:07:57: Sobol, failed: 5 ('VAL_ACC: <FLOAT>' not found), running/completed/pending 2/1/1∑4 (15%/20), job_failed
2025-07-31 17:08:07: Sobol, failed: 6 ('VAL_ACC: <FLOAT>' not found), running/pending 2/1∑3 (15%/20), finishing jobs (_get_next_trials), finished 1 job
2025-07-31 17:08:14: Sobol, failed: 6 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), getting new HP set
2025-07-31 17:08:31: Sobol, failed: 6 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), requested 1 jobs, got 1, 17.05 s/job
2025-07-31 17:08:37: Sobol, failed: 6 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), eval #1/1 start
2025-07-31 17:08:42: Sobol, failed: 6 ('VAL_ACC: <FLOAT>' not found), running 3∑3 (15%/20), starting new job
2025-07-31 17:08:54: Sobol, failed: 6 ('VAL_ACC: <FLOAT>' not found), running/unknown 3/1∑4 (20%/20), started new job
2025-07-31 17:09:00: Sobol, failed: 6 ('VAL_ACC: <FLOAT>' not found), running/pending 3/1∑4 (20%/20), job_failed
2025-07-31 17:09:11: Sobol, failed: 7 ('VAL_ACC: <FLOAT>' not found), running/pending 2/1∑3 (15%/20), finishing jobs (_get_next_trials), finished 1 job
2025-07-31 17:09:17: Sobol, failed: 7 ('VAL_ACC: <FLOAT>' not found), running/pending 2/1∑3 (15%/20), getting new HP set
2025-07-31 17:09:38: Sobol, failed: 7 ('VAL_ACC: <FLOAT>' not found), running/completed 2/1∑3 (10%/20), requested 1 jobs, got 1, 21.81 s/job
2025-07-31 17:09:44: Sobol, failed: 7 ('VAL_ACC: <FLOAT>' not found), running/completed 2/1∑3 (10%/20), eval #1/1 start
2025-07-31 17:09:54: Sobol, failed: 7 ('VAL_ACC: <FLOAT>' not found), best VAL_ACC: 3.67, running/completed 2/1∑3 (10%/20), starting new job
2025-07-31 17:10:00: Sobol, failed: 7 ('VAL_ACC: <FLOAT>' not found), best VAL_ACC: 3.67, running/completed/unknown 2/1/1∑4 (10%/20), started new job
2025-07-31 17:10:10: Sobol, failed: 7 ('VAL_ACC: <FLOAT>' not found), best VAL_ACC: 3.67, running/completed/pending 1/2/1∑4 (10%/20), job_failed
2025-07-31 17:10:10: Sobol, failed: 7 ('VAL_ACC: <FLOAT>' not found), best VAL_ACC: 3.67, running/completed/pending 1/2/1∑4 (10%/20), new result: 3.67
2025-07-31 17:10:49: Sobol, failed: 8 ('VAL_ACC: <FLOAT>' not found), best VAL_ACC: 3.67, running 2∑2 (10%/20), finishing jobs (_get_next_trials), finished 2 jobs
2025-07-31 17:10:54: Sobol, failed: 8 ('VAL_ACC: <FLOAT>' not found), best VAL_ACC: 3.67, running 2∑2 (10%/20), getting new HP set
2025-07-31 17:11:04: Sobol, failed: 8 ('VAL_ACC: <FLOAT>' not found), best VAL_ACC: 3.67, running 2∑2 (10%/20), requested 1 jobs, got 1, 9.66 s/job
2025-07-31 17:11:09: Sobol, failed: 8 ('VAL_ACC: <FLOAT>' not found), best VAL_ACC: 3.67, running 2∑2 (10%/20), eval #1/1 start
2025-07-31 17:11:15: Sobol, failed: 8 ('VAL_ACC: <FLOAT>' not found), best VAL_ACC: 3.67, running 2∑2 (10%/20), starting new job
2025-07-31 17:11:20: Sobol, failed: 8 ('VAL_ACC: <FLOAT>' not found), best VAL_ACC: 3.67, running/unknown 2/1∑3 (15%/20), started new job
2025-07-31 17:11:27: Sobol, failed: 8 ('VAL_ACC: <FLOAT>' not found), best VAL_ACC: 3.67, running/pending 2/1∑3 (15%/20), getting new HP set
2025-07-31 17:11:48: Sobol, failed: 8 ('VAL_ACC: <FLOAT>' not found), best VAL_ACC: 3.67, running 3∑3 (15%/20), requested 1 jobs, got 1, 21.15 s/job
2025-07-31 17:11:54: Sobol, failed: 8 ('VAL_ACC: <FLOAT>' not found), best VAL_ACC: 3.67, running 3∑3 (15%/20), eval #1/1 start
2025-07-31 17:12:05: Sobol, failed: 8 ('VAL_ACC: <FLOAT>' not found), best VAL_ACC: 3.67, running 3∑3 (15%/20), starting new job
2025-07-31 17:12:28: Sobol, failed: 8 ('VAL_ACC: <FLOAT>' not found), best VAL_ACC: 3.67, running/unknown 3/1∑4 (20%/20), started new job
Arguments Overview
Key | Value |
---|
config_yaml | None |
config_toml | None |
config_json | None |
num_random_steps | 20 |
max_eval | 500 |
run_program | [['cHl0aG9uMyAudGVzdHMvbW5pc3QvdHJhaW4gLS1lcG9jaHMgJWVwb2NocyAtLWxlYXJuaW5nX3JhdGUgJWxyIC0tYmF0Y2hfc2l6ZSAlYmF0Y2hfc2l6ZSAtLWhpZGRlbl9zaXplICVoaWRkZW5f… |
experiment_name | mnist_gpu_noall |
mem_gb | 10 |
parameter | [['epochs', 'range', '10', '200', 'int', 'false'], ['lr', 'range', '0.00001', '0.1', 'float', 'false'], ['batch_size', 'range', '8', '2048', 'int', |
| 'false'], ['hidden_size', 'range', '8', '2048', 'int', 'false'], ['dropout', 'range', '0', '0.5', 'float', 'false'], ['activation', 'fixed', |
| 'leaky_relu'], ['num_dense_layers', 'range', '1', '4', 'int', 'false'], ['init', 'fixed', 'normal'], ['weight_decay', 'range', '0', '1', 'float', |
| 'false']] |
continue_previous_job | None |
experiment_constraints | None |
run_dir | runs |
seed | None |
verbose_tqdm | False |
model | BOTORCH_MODULAR |
gridsearch | False |
occ | False |
show_sixel_scatter | False |
show_sixel_general | False |
show_sixel_trial_index_result | False |
follow | True |
send_anonymized_usage_stats | True |
ui_url | aHR0cHM6Ly9pbWFnZXNlZy5zY2Fkcy5kZS9vbW5pYXgvZ3VpP3BhcnRpdGlvbj1hbHBoYSZleHBlcmltZW50X25hbWU9bW5pc3RfZ3B1X25vYWxsJnJlc2VydmF0aW9uPSZhY2NvdW50PSZtZW1fZ2I… |
root_venv_dir | /home/pwinkler |
exclude | None |
main_process_gb | 8 |
max_nr_of_zero_results | 50 |
abbreviate_job_names | False |
orchestrator_file | None |
checkout_to_latest_tested_version | False |
live_share | True |
disable_tqdm | False |
disable_previous_job_constraint | False |
workdir | |
occ_type | euclid |
result_names | ['VAL_ACC=max'] |
minkowski_p | 2 |
signed_weighted_euclidean_weights | |
generation_strategy | None |
generate_all_jobs_at_once | False |
revert_to_random_when_seemingly_exhausted | True |
load_data_from_existing_jobs | [] |
n_estimators_randomforest | 100 |
max_attempts_for_generation | 20 |
external_generator | None |
username | None |
max_failed_jobs | 0 |
num_cpus_main_job | None |
calculate_pareto_front_of_job | [] |
show_generate_time_table | False |
force_choice_for_ranges | False |
max_abandoned_retrial | 20 |
share_password | None |
dryrun | False |
db_url | None |
run_program_once | None |
dont_warm_start_refitting | False |
refit_on_cv | False |
fit_out_of_design | False |
fit_abandoned | False |
dont_jit_compile | False |
num_restarts | 20 |
raw_samples | 1024 |
max_num_of_parallel_sruns | 16 |
no_transform_inputs | False |
no_normalize_y | False |
transforms | [] |
num_parallel_jobs | 20 |
worker_timeout | 120 |
slurm_use_srun | False |
time | 2880 |
partition | alpha |
reservation | None |
force_local_execution | False |
slurm_signal_delay_s | 0 |
nodes_per_job | 1 |
cpus_per_task | 1 |
account | None |
gpus | 1 |
run_mode | local |
verbose | False |
verbose_break_run_search_table | False |
debug | False |
flame_graph | False |
no_sleep | False |
tests | False |
show_worker_percentage_table_at_end | False |
auto_exclude_defective_hosts | False |
run_tests_that_fail_on_taurus | False |
raise_in_eval | False |
show_ram_every_n_seconds | 0 |
show_generation_and_submission_sixel | False |
just_return_defaults | False |
prettyprint | False |
1753973997.0729675,20,0,0
1753974000.8027582,20,0,0
1753974000.9815824,20,0,0
1753974006.1057937,20,0,0
1753974013.734341,20,0,0
1753974017.6994092,20,0,0
1753974021.710163,20,0,0
1753974043.8961456,20,1,5
1753974051.7096143,20,1,5
1753974059.2619443,20,1,5
1753974063.7025058,20,1,5
1753974068.0156422,20,1,5
1753974073.8424482,20,2,10
1753974078.9361887,20,2,10
1753974108.0578673,20,2,10
1753974114.1028826,20,2,10
1753974118.7199514,20,2,10
1753974124.083336,20,3,15
1753974129.2696266,20,3,15
1753974139.3340576,20,3,15
1753974144.7343316,20,3,15
1753974149.6994007,20,3,15
1753974154.9728312,20,4,20
1753974160.9145067,20,4,20
1753974169.735192,20,3,15
1753974174.726105,20,3,15
1753974185.291692,20,3,15
1753974209.1218338,20,3,15
1753974220.722943,20,3,15
1753974220.7264423,20,3,15
1753974242.024544,20,3,15
1753974247.9193194,20,3,15
1753974257.711289,20,3,15
1753974262.7087498,20,2,10
1753974273.049002,20,2,10
1753974280.1913927,20,3,15
1753974306.7262595,20,3,15
1753974318.1058397,20,3,15
1753974323.0024755,20,3,15
1753974332.7123349,20,3,15
1753974345.913593,20,2,10
1753974356.7130108,20,2,10
1753974362.8736148,20,3,15
1753974367.973043,20,3,15
1753974377.7464514,20,3,15
1753974383.1652508,20,3,15
1753974393.0685453,20,3,15
1753974397.7544036,20,3,15
1753974403.7603633,20,3,15
1753974410.3302603,20,3,15
1753974420.0282671,20,3,15
1753974429.1854503,20,3,15
1753974434.2614586,20,3,15
1753974448.708943,20,3,15
1753974453.1407683,20,3,15
1753974457.72604,20,3,15
1753974463.9013326,20,4,20
1753974477.9453053,20,3,15
1753974487.7409027,20,3,15
1753974494.1707904,20,3,15
1753974510.8748155,20,3,15
1753974517.0688016,20,3,15
1753974522.2158282,20,3,15
1753974534.879701,20,4,20
1753974540.1598833,20,4,20
1753974551.7076232,20,3,15
1753974557.10588,20,3,15
1753974578.7090914,20,2,10
1753974584.1052878,20,2,10
1753974594.3009734,20,2,10
1753974600.8457654,20,2,10
1753974610.73072,20,2,10
1753974610.7349787,20,2,10
1753974617.226526,20,2,10
timestamp,ram_usage_mb,cpu_usage_percent
1753973993,711.765625,7.4
1753973997,712.765625,12.8
1753974000,712.765625,5.9
1753974000,712.765625,7.7
1753974000,712.765625,6.5
1753974000,712.765625,7.4
1753974000,712.765625,8.3
1753974617,731.17578125,21.4
Parameter statistics
Parameter | Min | Max | Mean | Std Dev | Count |
---|
run_time | 30 | 467 | 80.6667 | 136.6545 | 9 |
VAL_ACC | 3.67 | 3.67 | 3.67 | 0 | 1 |
epochs | 16 | 199 | 112.9231 | 56.8675 | 13 |
lr | 0.0057 | 0.0971 | 0.0536 | 0.0273 | 13 |
batch_size | 90 | 1930 | 1043.0769 | 591.1404 | 13 |
hidden_size | 101 | 1997 | 1019.0769 | 586.2334 | 13 |
dropout | 0.0481 | 0.4791 | 0.265 | 0.1349 | 13 |
num_dense_layers | 1 | 4 | 2.6154 | 1.1461 | 13 |
weight_decay | 0.0125 | 0.9495 | 0.4761 | 0.3061 | 13 |
activation | No numerical statistics available |
init | No numerical statistics available |
Show SLURM-Job-ID (if it exists)
submitit INFO (2025-07-31 17:01:14,732) - Starting with JobEnvironment(job_id=531227, hostname=c152, local_rank=0(1), node=0(1), global_rank=0(1))
submitit INFO (2025-07-31 17:01:14,732) - Loading pickle: /data/cat/ws/pwinkler-mnist_tst/omniopt/runs/mnist_gpu_noall/3/single_runs/531227/531227_submitted.pkl