Agreed, and I would love that sort of info in a standardized way. If the reviewers use different cells then total runtime can be misleading.

Say I buy a flashlight that is tested as having good efficiency, but it only has 3 modes and I want 5 modes. I could replace the stock driver with a driver that is extremely configurable. But I doubt I would do so if it turns out the new driver is 20% less efficient.

And yeah, I do understand testing can be difficult when thermal regulation kicks in on those super bright turbo modes.
Personally I need far less brightness. So tests at 300, 600, 1000 lumen would be what I would like to see. Those brightness levels often can be sustained for a long time.
Maybe the flashlight has a 2500lm turbo that steps down after a minute. Because that’s so short I don’t really care about that efficiency.