Yes, gradient descent is way more important for machine learning, and also if you have a lot of parameters the matrices get too big to efficiently invert them and you are better of with gradient descent. When I started looking into ML I was very confused that linear regression is even considered machine learning. But you can explain a lot of important things with it like gradient descent, supervised learning over/underfitting etc.