Skip to content

This repository provides HAWP: a dataset for Hindi Word Problem Solving and a baseline (LREC 2022)

Notifications You must be signed in to change notification settings

hellomasaya/hawp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Hindi Arithmetic Word Problems - HAWP

This repository provides HAWP(a first ever diverse dataset for Hindi) for evaluating and developing MWP Solvers for Hindi. It contains 2336 Hindi Math Word Problems (MWPs), and is published in the paper "HAWP: a Dataset for Hindi Arithmetic Word Problem Solving".

The dataset can be found in hawp/dataset/

About the dataset

Name HAWP
Version v1.0
Resource Type Corpus
Language Type Monolingual
Language Hindi
Size 2336 MWPs
Modality Written
Annotation Equation, Relevant Indices, Number Of Operators
Use For Automatic Word Problem Solving
Format JSON

Snapshot of dataset:

{
    "pIndex": 0,
    "Problem": "सोहन ने 7.423 की.ग्रा. चावल तथा 6.129 की.ग्रा. दाल खरीदी। सोहन ने कुल कितना सामान खरीदा?",
    "Equation": "X = ( 7.423 + 6.129 )",
    "Relevant Indices": [
      0,
      1
    ],
    "Number of Operators": 1
  }
  • pIndex: A unique identifier for a MWP
  • Problem: The word problem text (includes statements and clues for solving an MWP) and question to be solved based on the word problem text.
  • Equation: The equation used to solve the MWP
  • Relevant Indices: List of incides of quantities in the MWP that are relevant to solve the question. Whenever the MWP requires world knowledge or uses implict quantities, the relevant indices list contains implicit keyword.
  • Number of Operators: Number of operations required to solve the MWP.

About

This repository provides HAWP: a dataset for Hindi Word Problem Solving and a baseline (LREC 2022)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published