diff --git a/README.md b/README.md
index 1db135a..927b36b 100644
--- a/README.md
+++ b/README.md
@@ -157,6 +157,8 @@ Confirming that 30B model is able to generate SQL code: https://github.com/randa
 
 ## Hugging Face 🤗 version (inference & training)
 
+### Inference
+
 Thanks to Yam Peleg, we now have *"No overengineering bullshit"* version.
 
 You do not need to download torrent or merge weights, as model shards and tokenizer will be downloaded from HF automatically at the first run. They will be cached in [C:\Users\USERNAME\\.cache\huggingface\hub] folder under Windows, so do not forget to clean up to 250 Gb after experiments.