> As of 1 Oct 2024, this model is #1 on all three automatic alignment benchmarks (verified tab for AlpacaEval 2 LC), edging out strong frontier models such as GPT-4o and Claude 3.5 Sonnet.
> this model can correctly the question How many r in strawberry? without specialized prompting or additional reasoning tokens
> As of 1 Oct 2024, this model is #1 on all three automatic alignment benchmarks (verified tab for AlpacaEval 2 LC), edging out strong frontier models such as GPT-4o and Claude 3.5 Sonnet.
> this model can correctly the question How many r in strawberry? without specialized prompting or additional reasoning tokens
Can be tested here: https://huggingface.co/chat/models/nvidia/Llama-3.1-Nemotron...