└── README.md
/README.md:
--------------------------------------------------------------------------------
1 | ## Xlit-Crowd: Hindi-English Transliteration Corpus
2 |
3 | The corpus contains transliteration pairs for Hindi-English. These pairs were obtained via crowdsourcing by asking workers to transliterate Hindi words into the Roman script. The tasks were done on Amazon Mechanical Turk and yielded a total of 14919 pairs.
4 |
5 | The details regarding the dataset are mentioned in the following paper. Kindly cite this paper if you are using this dataset for research:
6 |
7 | Mitesh M. Khapra, Ananthakrishnan Ramanathan, Anoop Kunchukuttan, Karthik Visweswariah, Pushpak Bhattacharyya. __When Transliteration Met Crowdsourcing : An Empirical Study of Transliteration via Crowdsourcing using Efficient, Non-redundant and Fair Quality Control__ . Language and Resources and Evaluation Conference (LREC 2014). 2014.
8 |
9 | ### License
10 |
11 | 
Xlit-Crowd: Hindi-English Transliteration Corpus by Mitesh Khapra is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
12 |
--------------------------------------------------------------------------------