Skip to main content

Command Palette

Search for a command to run...

#71 Machine Learning & Data Science Challenge 71

Published
1 min read
#71 Machine Learning & Data Science Challenge 71
B

Greetings.

I am a machine learning engineer based in India, possessing a sustained interest in machine learning since my undergraduate studies. I have completed Stanford University's machine learning course (Andrew Ng) via Coursera, and IBM's machine learning and deep learning curriculum. My current focus is on machine learning and data science projects, aiming to leverage my expertise for impactful, real-world problem-solving.

List down hyperparameter tuning in Deep Learning.

The process of setting the hyper-parameters requires expertise and extensive trial and error.

  • There are no simple and easy ways to set hyper-parameters — specifically, learning rate, batch size, momentum, and weight decay.

Approaches to searching for the best configuration:

• Grid Search

• Random Search

Approach:

  1. Observe and understand the clues available during training by monitoring validation/test loss early in training, and tune your architecture and hyper-parameters with short runs of a few epochs.

  2. Signs of underfitting or overfitting of the test or validation loss early in the training process are useful for tuning the hyperparameters.

Tools for Optimizing Hyperparameters:

• Sage Maker

• Comet.ml

• Weights &Biases

• Deep Cognition

• Azure ML

More from this blog