Skip to content

Commit 7529a22

Browse files
committed
Small fixes for EMNLP paper
1 parent d7d9db1 commit 7529a22

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

index.html

+3-3
Original file line numberDiff line numberDiff line change
@@ -131,8 +131,8 @@
131131
<br>
132132
<p>This website is a work in progress. Please see my <a href="https://scholar.google.com/citations?hl=en&user=tc0M7WwAAAAJ&view_op=list_works&sortby=pubdate">Google Scholar</a> for a full, more up-to-date list of my publications!</p>
133133

134-
<div class="row content-summary pt-4 pb-2"></div>
135-
134+
<div class="row content-summary pt-4 pb-2">
135+
136136
<div class="d-none d-sm-block col-sm-3 m-0 p-0">
137137

138138
<img src=/media/papers/prob_reasoning_in_LLMs/prob_reasoning_in_LLMs.png class="img-fluid summary-image drop-shadow" alt="teaser img">
@@ -146,7 +146,7 @@
146146
EMNLP 2024 (Main)
147147
</p>
148148

149-
<p class="summary-text">Probabilistic reasoning is a key challenge for large language models (LLMs) that requires understanding and interpreting numerical data across distributions. In our paper, we systematically evaluate LLMs on three core tasks—percentile estimation, sampling, and probability calculation—across both real-world and idealized distributions. By incorporating techniques such as within-distribution anchoring, real-world context, and simplifying assumptions (e.g., Normal approximations), we demonstrate performance improvements of up to 70% over baseline methods. We will release our benchmark dataset to encourage further development of the reasoning capabilities of LLMs, allowing them to become more useful, safer, and more reliable.</p>
149+
<p class="summary-text">Probabilistic reasoning is a key challenge for large language models (LLMs). Our paper evaluates LLMs on three tasks of estimating percentiles, drawing samples, and calculating probabilties using real-world and idealized distributions. Techniques such as within-distribution anchoring, real-world context, and simplifying assumptions (e.g., Normal approximations) improved performance by up to 70%.</p>
150150

151151
<div class="d-flex flex-row flex-wrap">
152152

0 commit comments

Comments
 (0)