Eval JavaScript - Search News

Developer-targeting campaign using malicious Next.js repositories

A developer-targeting campaign leveraged malicious Next.js repositories to trigger a covert RCE-to-C2 chain through standard ...

IEEE

CAST-Eval: A Domain-Specific Benchmark for Large Language Models in Civil Aviation Safety

Abstract: In this paper, we present CAST-Eval, a novel, comprehensive and domain-specific benchmark designed to assess the knowledge and reasoning capabilities of large language models (LLMs) in the ...

GitHub

EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees

If you have any questions about the code or the paper, feel free to contact Zhiyuan Zeng ([email protected] or [email protected]). If you encounter any issues while using the code or want ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Developer-targeting campaign using malicious Next.js repositories

CAST-Eval: A Domain-Specific Benchmark for Large Language Models in Civil Aviation Safety

EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees

Trending now