How to evaluate large models?

Read, quite useful

See how others do it

Read, full of nonsense