natanaelwf

Natanael Fraga natanaelwf

Popular repositories Loading

LLMTest_FindTheOrigin LLMTest_FindTheOrigin Public

Testing reasoning degradation in LLMs with variable context windows and information organization.

Python 2
evals evals Public

Forked from openai/evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python
LLM_AdditionalTests_LongPrompts LLM_AdditionalTests_LongPrompts Public

Repository for the extended versions of prompts used in the research paper titled "Challenging LLMs Beyond Information Retrieval: Reasoning Degradation with Long Context Windows."
Reasoning-Degradation_Paper Reasoning-Degradation_Paper Public

Full content of the paper 'Challenging Large Language Models (LLMs) Beyond Information Retrieval: Reasoning Degradation with Long Context Windows.'