├── .gitattributes ├── .yo-rc.json ├── contributing.md ├── code-of-conduct.md └── readme.md /.gitattributes: -------------------------------------------------------------------------------- 1 | * text=auto 2 | readme.md merge=union 3 | -------------------------------------------------------------------------------- /.yo-rc.json: -------------------------------------------------------------------------------- 1 | { 2 | "generator-awesome-list": { 3 | "promptValues": { 4 | "email": "yusuf@quiec.tech" 5 | } 6 | } 7 | } 8 | -------------------------------------------------------------------------------- /contributing.md: -------------------------------------------------------------------------------- 1 | # Contribution Guidelines 2 | 3 | Please note that this project is released with a 4 | [Contributor Code of Conduct](code-of-conduct.md). By participating in this 5 | project you agree to abide by its terms. 6 | 7 | --- 8 | 9 | Ensure your pull request adheres to the following guidelines: 10 | 11 | - Make sure you take care of this 12 | - And this as well 13 | - And don't forget to check this 14 | 15 | Thank you for your suggestions! 16 | 17 | 18 | ## Updating your PR 19 | 20 | A lot of times, making a PR adhere to the standards above can be difficult. 21 | If the maintainers notice anything that we'd like changed, we'll ask you to 22 | edit your PR before we merge it. There's no need to open a new PR, just edit 23 | the existing one. If you're not sure how to do that, 24 | [here is a guide](https://github.com/RichardLitt/knowledge/blob/master/github/amending-a-commit-guide.md) 25 | on the different ways you can update your PR so that we can merge it. 26 | -------------------------------------------------------------------------------- /code-of-conduct.md: -------------------------------------------------------------------------------- 1 | # Contributor Covenant Code of Conduct 2 | 3 | ## Our Pledge 4 | 5 | In the interest of fostering an open and welcoming environment, we as 6 | contributors and maintainers pledge to making participation in our project and 7 | our community a harassment-free experience for everyone, regardless of age, body 8 | size, disability, ethnicity, gender identity and expression, level of experience, 9 | nationality, personal appearance, race, religion, or sexual identity and 10 | orientation. 11 | 12 | ## Our Standards 13 | 14 | Examples of behavior that contributes to creating a positive environment 15 | include: 16 | 17 | * Using welcoming and inclusive language 18 | * Being respectful of differing viewpoints and experiences 19 | * Gracefully accepting constructive criticism 20 | * Focusing on what is best for the community 21 | * Showing empathy towards other community members 22 | 23 | Examples of unacceptable behavior by participants include: 24 | 25 | * The use of sexualized language or imagery and unwelcome sexual attention or 26 | advances 27 | * Trolling, insulting/derogatory comments, and personal or political attacks 28 | * Public or private harassment 29 | * Publishing others' private information, such as a physical or electronic 30 | address, without explicit permission 31 | * Other conduct which could reasonably be considered inappropriate in a 32 | professional setting 33 | 34 | ## Our Responsibilities 35 | 36 | Project maintainers are responsible for clarifying the standards of acceptable 37 | behavior and are expected to take appropriate and fair corrective action in 38 | response to any instances of unacceptable behavior. 39 | 40 | Project maintainers have the right and responsibility to remove, edit, or 41 | reject comments, commits, code, wiki edits, issues, and other contributions 42 | that are not aligned to this Code of Conduct, or to ban temporarily or 43 | permanently any contributor for other behaviors that they deem inappropriate, 44 | threatening, offensive, or harmful. 45 | 46 | ## Scope 47 | 48 | This Code of Conduct applies both within project spaces and in public spaces 49 | when an individual is representing the project or its community. Examples of 50 | representing a project or community include using an official project e-mail 51 | address, posting via an official social media account, or acting as an appointed 52 | representative at an online or offline event. Representation of a project may be 53 | further defined and clarified by project maintainers. 54 | 55 | ## Enforcement 56 | 57 | Instances of abusive, harassing, or otherwise unacceptable behavior may be 58 | reported by contacting the project team at yusuf@quiec.tech. All 59 | complaints will be reviewed and investigated and will result in a response that 60 | is deemed necessary and appropriate to the circumstances. The project team is 61 | obligated to maintain confidentiality with regard to the reporter of an incident. 62 | Further details of specific enforcement policies may be posted separately. 63 | 64 | Project maintainers who do not follow or enforce the Code of Conduct in good 65 | faith may face temporary or permanent repercussions as determined by other 66 | members of the project's leadership. 67 | 68 | ## Attribution 69 | 70 | This Code of Conduct is adapted from the [Contributor Covenant][homepage], version 1.4, 71 | available at [http://contributor-covenant.org/version/1/4][version] 72 | 73 | [homepage]: http://contributor-covenant.org 74 | [version]: http://contributor-covenant.org/version/1/4/ 75 | -------------------------------------------------------------------------------- /readme.md: -------------------------------------------------------------------------------- 1 | # Awesome Turkish NLP [![Awesome](https://awesome.re/badge.svg)](https://awesome.re) 2 | 3 | > A curated list of awesome Turkish language processing libraries, models, resources and datasets. 4 | The main focus is on **open source** tools, **downloadable** data and research **papers with code**. 5 | 6 | ## Contents 7 | - [Libraries](#Libraries) 8 | - [Java](#java) 9 | - [C#](#c) 10 | - [Python](#python) 11 | - [Haskell](#haskell) 12 | - [PHP](#php) 13 | 14 | - [Datasets](#datasets) 15 | - [Models](#models) 16 | - [Resources](#resources) 17 | - [Articles](#articles) 18 | - [Videos](#videos) 19 | - [Courses](#courses) 20 | - [Books](#books) 21 | - [Contribute](#contribute) 22 | 23 | ## Libraries 24 | ### Java 25 | - [ahmetaa/zemberek-nlp](https://github.com/ahmetaa/zemberek-nlp) - The most known Turkish NLP tool. 26 | - [cbilgili/zemberek-nlp-server](https://github.com/cbilgili/zemberek-nlp-server) - REST API for Zemberek. 27 | 28 | ### C# 29 | - [hrzafer/nuve](https://github.com/hrzafer/nuve) - Natural Language Processing Library for Turkish in C# 30 | - [JnRMnT/ZemberekDotNet](https://github.com/JnRMnT/ZemberekDotNet) - dotNet port of Zemberek. 31 | - [yasinkuyu/Turkish.cs](https://github.com/yasinkuyu/Turkish.cs) - An C# tool for create Turkish Words. 32 | 33 | ### Python 34 | - [brolin59/trnlp](https://github.com/brolin59/trnlp) - Some tools for Turkish NLP. 35 | - [akoksal/Turkce-Duygu-Takibi](https://github.com/akoksal/Turkce-Duygu-Takibi) - Sentiment analysis tool for Twitter Turkey. 36 | - [MeteHanC/turkishnlp](https://github.com/MeteHanC/turkishnlp) - Basic but good Turkish NLP tools. 37 | - [Loodos/zemberek-python](https://github.com/Loodos/zemberek-python) - Python port of Zemberek. 38 | - [otuncelli/turkish-stemmer-python](https://github.com/otuncelli/turkish-stemmer-python) - Turkish Stemmer for Python. 39 | - [deeplearningturkiye/kelime_kok_ayirici](https://github.com/deeplearningturkiye/kelime_kok_ayirici) - Turkish Stemmer for Python with DeepLearning. 40 | - [emres/turkish-deasciifier](https://github.com/emres/turkish-deasciifier) - Turkish Deasciifier. 41 | - [google-research/turkish-morphology](https://github.com/google-research/turkish-morphology) - A two-level morphological analyzer for Turkish. 42 | - [GlobalMaksimum/sadedegel](https://github.com/GlobalMaksimum/sadedegel) - Turkish news summarizer. 43 | - [Fixy-TR/fixy](https://github.com/Fixy-TR/fixy) - A library that can analyze emotion and correct spelling according to some rules in Turkish. 44 | - [ardauzunoglu/rte-speech-generator](https://github.com/ardauzunoglu/rte-speech-generator) - Natural Language Processing to generate new speeches for the President of Turkey. 45 | - [akoksal/Turkish-Word2Vec](https://github.com/akoksal/Turkish-Word2Vec) - Pre-trained Word2Vec Model for Turkish. 46 | - [arcanteus/formally](https://github.com/arcanteus/formally) - Turkish NLP Formal/Informal Classification System. 47 | 48 | ### Haskell 49 | - [joom/Guguk](https://github.com/joom/Guguk) - Turkish NLP library for Haskell. 50 | - [joom/dilacar](https://github.com/joom/dilacar) - A rule-based machine translation system from Ottoman Turkish to Modern Turkish. 51 | 52 | ### PHP 53 | - [Quiec/DeAsciifier](https://github.com/Quiec/DeAsciifier) - Turkish Deasciifier. 54 | 55 | ## Datasets 56 | - [Turkish Wikipedia Dump](https://www.kaggle.com/mustfkeskin/turkish-wikipedia-dump) 57 | - [Turkish Positive/Negative/Notr Tweets](https://github.com/sercankulcu/twitterdata) 58 | - [Turkish Offensive Tweets](https://coltekin.github.io/offensive-turkish/) 59 | 60 | ## Models 61 | - [BERTurk](https://github.com/stefan-it/turkish-bert) - Turkish BERT/DistilBERT and ELECTRA models 62 | - [Turkish Language Models](https://github.com/Loodos/turkish-language-models) - Some Trained Models 63 | - [BERTurk-Social](https://huggingface.co/ibahadiraltun/berturk-social) - Model created using social media. 64 | - [kuzgunlar/electra-turkish-ner](https://huggingface.co/kuzgunlar/electra-turkish-ner) 65 | - [dbmdz/bert-base-turkish-cased](https://huggingface.co/dbmdz/bert-base-turkish-cased) 66 | - [savasy/bert-base-turkish-sentiment-cased](https://huggingface.co/savasy/bert-base-turkish-sentiment-cased) 67 | - [More Models](https://huggingface.co/models?search=turkish) 68 | 69 | ## Resources 70 | - [ITU NLP list](http://ddi.itu.edu.tr/en/toolsandresources) 71 | 72 | ### Articles 73 | - [Two-level Description of Turkish Morphology](http://repository.bilkent.edu.tr/bitstream/handle/11693/25941/Two_level_description_of_Turkish_Morphology.pdf?sequence=1) 74 | - [Doğal Dil İşleme İle Türkçe Yazım Hatalarının Denetlenmesi](https://polen.itu.edu.tr/handle/11527/321) 75 | - [Türkçe ve Doğal Dil İşleme](https://dergipark.org.tr/en/pub/tbbmd/issue/22245/238795) 76 | - [Doğal Dil İşleme](https://dergipark.org.tr/en/pub/tbbmd/issue/22245/238797) 77 | - [TÜRKÇE İÇİN DERİN ÖĞRENME TABANLI DOĞAL DİL İŞLEME MODELİ GELİŞTİRİLMESİ](https://researchgate.net/profile/Adem_Tekerek/publication/332912743_DEVELOPMENT_OF_DEEP_LEARNING_BASED_NATURAL_LANGUAGE_PROCESSING_MODEL_FOR_TURKISH/links/5ddcc338a6fdccdb44656f6e/DEVELOPMENT-OF-DEEP-LEARNING-BASED-NATURAL-LANGUAGE-PROCESSING-MODEL-FOR-TURKISH.pdf) 78 | - [TÜRKÇE OTOMATİK HECELEME SİSTEMİ VE HECE İSTATİSTİKLERİ](https://akademik.adu.edu.tr/bolum/fef/matematik/webfolders/file/personel/a1005/195.pdf) 79 | - [Türkçe Haber Metinlerinin Konvolüsyonel Sinir Ağları ve Word2Vec Kullanılarak Sınıflandırılması](https://dergipark.org.tr/en/pub/gazibtd/issue/47484/457917) 80 | - [LSTM Ağları ile Türkçe Kök Bulma](https://dergipark.org.tr/tr/download/article-file/775805) 81 | 82 | ### Videos 83 | - [Uygulamalı Türkçe doğal dil işleme evreleri](https://www.youtube.com/watch?v=KoQ-_rp07Rg) 84 | - [Türkçe Doğal Dil İşleme - Kemal Oflazer](https://www.youtube.com/watch?v=CzIUICJXAjY) 85 | 86 | ### Courses 87 | - [Doğal Dil İşleme A-Z](https://www.udemy.com/course/dogal-dil-isleme/) 88 | 89 | ### Books 90 | - []() 91 | 92 | ## Contribute 93 | If you want to contribute to this list (please do), read the [contribution guidelines](contributing.md) first then send me a pull request. 94 | Also, a listed repository should be tagged as deprecated if: 95 | 96 | * Repository's owners explicitly say that "this library is not maintained". 97 | * Not committed for long time (2~3 years). 98 | 99 | ## Credits 100 | This list created by [Yusuf Usta](https://github.com/quiec) with ❤️ --------------------------------------------------------------------------------