A confident label is not the same as a strong study. A supplement can be marketed hard, sold everywhere, and backed by very little, while another with genuine trial evidence sits quietly next to it for the same price. "Evidence-graded" just means someone has done the work of telling those two apart and shown you the grade, which is the column missing from the back of every supplement bottle.
Here is how the grading works, and the part that matters most: the list of things that do not make the cut.
Four grades, plainly
Grade A means several good randomised trials, ideally pulled together in a meta-analysis, pointing the same way, with a safety profile that holds up at the dose being suggested. Grade B means the evidence is real but thinner: fewer trials, smaller effects, or some inconsistency. Grade C means the idea is promising but early, and anything in this band should carry an honest "emerging" flag rather than a confident claim. Grade D means the evidence is weak, absent, or outweighed by a safety problem. Grade D things are never recommended.
The part worth reading: what gets rejected, and why
A grading system is only as honest as the list it rules out. Three examples show why a compound lands on the reject pile, and none of them is simply "it does not work".
Kava
Kava has a defensible case for easing anxiety. It also has documented reports of severe liver injury, serious enough that several national regulators have restricted it. When a safer option exists for a similar effect, the trade-off does not justify an unsupervised over-the-counter product. That is a safety call, not an efficacy one.
St John's Wort
St John's Wort genuinely helps mild-to-moderate low mood. The problem is what it does to everything else: it speeds up a liver enzyme that clears around half of all prescription medicines, which can quietly lower the level of the contraceptive pill, blood thinners, transplant drugs and more. It is on the reject list not because it is useless but because it is too disruptive to sit in an unsupervised supplement.
High-dose beta-carotene, in smokers
This one shows why context changes the grade. In non-smokers, beta-carotene is unremarkable. In two large trials of smokers, the supplemented group had more lung cancer, not less, and one trial was stopped early. The same compound can be acceptable in one population and genuinely harmful in another. The grade depends on who is taking it.
Why the reject list is the point
In a lot of cases the "considered but not recommended" list is longer than the recommendation list. That asymmetry is the honest version of the job. A grade is meaningful in proportion to the things it is willing to rule out, which is also why the interactions behind a medicine sit alongside the evidence when a supplement is judged. And for some nutrients the more useful question is not the grade at all, but whether you needed the supplement in the first place.