(base) [root@localhost ~]# docker exec finetune-trainer bash -c 'tail -n 20 $(ls -t /tmp/train_*.log | head -1)'
[remote_train]   Preprocessing done, output: /root/Fine-tuning/backend/data/processed/fb18c7a8-e275-4014-b6a3-dea08f3f7adb_processed.jsonl
[remote_train] Step 2: Loading model: Qwen/Qwen1.5-0.5B...
[remote_train]   Quantization: None
Loading weights: 100%|██████████| 291/291 [00:04<00:00, 59.89it/s] 
[remote_train]   Model loaded successfully
[remote_train] Step 3: Building PEFT config...
[remote_train]   PEFT config built
[remote_train] Step 4: Starting training...
Map: 100%|██████████| 274147/274147 [00:15<00:00, 18259.13 examples/s]
/opt/conda/lib/python3.10/site-packages/peft/tuners/tuners_utils.py:1348: UserWarning: Model has `tie_word_embeddings=True` and a tied layer is part of the adapter, but `ensure_weight_tying` is not set to True. This can lead to complications, for example when merging the adapter or converting your model to formats other than safetensors. Check the discussion here: https://github.com/huggingface/peft/issues/2777
  warnings.warn(msg)
[transformers] warmup_ratio is deprecated and will be removed in v5.2. Use `warmup_steps` instead.
/opt/conda/lib/python3.10/site-packages/torchvision/datapoints/__init__.py:12: UserWarning: The torchvision.datapoints and torchvision.transforms.v2 namespaces are still Beta. While we do not expect major breaking changes, some APIs may still change according to user feedback. Please submit any feedback you may have in this issue: https://github.com/pytorch/vision/issues/6753, and you can also check out https://github.com/pytorch/vision/issues/7319 to learn more about the APIs that we suspect might involve future changes. You can silence this warning by calling torchvision.disable_beta_transforms_warning().
  warnings.warn(_BETA_TRANSFORMS_WARNING)
/opt/conda/lib/python3.10/site-packages/torchvision/transforms/v2/__init__.py:54: UserWarning: The torchvision.datapoints and torchvision.transforms.v2 namespaces are still Beta. While we do not expect major breaking changes, some APIs may still change according to user feedback. Please submit any feedback you may have in this issue: https://github.com/pytorch/vision/issues/6753, and you can also check out https://github.com/pytorch/vision/issues/7319 to learn more about the APIs that we suspect might involve future changes. You can silence this warning by calling torchvision.disable_beta_transforms_warning().
  warnings.warn(_BETA_TRANSFORMS_WARNING)
trainable params: 5,593,088 || all params: 469,580,800 || trainable%: 1.1911
  0%|          | 0/4284 [00:00<?, ?it/s]/opt/conda/lib/python3.10/site-packages/transformers/models/qwen2/modeling_qwen2.py:108: UserWarning: Attempting to run cuBLAS, but there was no current CUDA context! Attempting to set the primary context... (Triggered internally at /workspace/framework/mcPytorch/aten/src/ATen/cuda/CublasHandlePool.cpp:183.)
  freqs = (inv_freq_expanded.float() @ position_ids_expanded.float()).transpose(1, 2)
 19%|█▉        | 812/4284 [21:54<1:34:25,  1.63s/it](base) [root@localhost ~]# 
