I used StandardScaler on the entire dataset before splitting, and later realized that might be wrong. How should I handle scaling correctly to avoid leakage?
Technomantic Latest Questions
I’m seeing 98–99% accuracy on my validation set, but when I test on truly unseen data, the performance drops significantly. Suspecting some leakage but not sure where it’s happening