├── CFP-One.md ├── CFP-Tweets.md ├── Figure-Out-OpenReview.md ├── LICENSE ├── PC-Invite-Template.md ├── PC-Preworkshop-Email-NoReplyFollowUp.md ├── PC-Preworkshop-Email.md ├── PC-Reinvite-Template.md ├── Proposal-2023.md ├── README.md ├── Reviewing-Period-Start.md ├── Softconf-Setup.md ├── Speaker-Invite-1.md ├── Speaker-Invite-2.md ├── Speaker-Invite-3.md ├── Speaker-Invite-OAI.md ├── Sponsorship-Invite-Template.md ├── decisions.md ├── nlposs-2018 ├── CFP-2018.md ├── Proposal-2018.md └── schedule-2018.orderfile ├── nlposs-2020 ├── CPF-One.md ├── CPF-two.md ├── PC-Invite-Template.md ├── PC-Reinvite-Template.md ├── PC-Updates-1.md ├── PC-Updates-2.md ├── PC-reInvite-noreply-follow-up.md ├── PC-reInvite-noreply-followup-lastcheck-finalthankyou.md ├── PC-reInvite-noreply-followup-lastcheck.md ├── Proposal-2020.md ├── Speaker-Invite-1.md ├── Speaker-Invite-2.md ├── Speaker-Invite-3.md ├── Speaker-Invite-4-5.md └── Speaker-Invite-6-7-8.md ├── nlposs-2023-admin-1 ├── README.md ├── papers_workshop_nlposs.yml └── program_workshop_nlposs.yml └── nlposs-2023 ├── 04-calamanCy-Poster.pdf ├── 04-calamanCy-Presentation.pdf ├── 05-Jina-Embeddings-A-Novel-Set-of-High-Performance-Sentence-Embedding-Models-Poster.pdf ├── 05-Jina-Embeddings-A-Novel-Set-of-High-Performance-Sentence-Embedding-Models-Slide.pdf ├── 06-Deepparse EMNLP.pdf ├── 06-Deepparse___An_Extendable__and_Fine_Tunable_State_Of_The_Art_Library_for_Parsing_Multinational_Street_Addresses_Poster.pdf ├── 08-PyThaiNLP-Poster.pdf ├── 08-PyThaiNLP-Slide.pdf ├── 09-Empowering_KD_from_SL_RAA_Poster.pdf ├── 09-Empowering_KD_from_SL_RAA_lightning_slide.pdf ├── 10-lgrobol-zeldarose.pdf ├── 11-gpt4all_lightning_slide.png ├── 11-gpt4all_poster.png ├── 12-kani-a-lightweight-and-highly-hackable-framework-for-building-language-model-applications-poster.pdf ├── 12-kani-a-lightweight-and-highly-hackable-framework-for-building-language-model-applications-slide.pdf ├── 13_gector_case_study_poster.pdf ├── 13_gector_case_study_ppt.pdf ├── 16-EDGAR_CRAWLER_poster.png ├── 16-EDGAR_CRAWLER_slide_lightning_presentation.pdf ├── 17-ACL-Anthology-Poster.pdf ├── 17-ACL-Anthology-Slide.pdf ├── 18-nanoT5-Poster.pdf ├── 18-nanoT5-Slide.pdf ├── 19_AWAREText_lightning_talk_slide.pdf ├── 19_AWAREText_poster.pdf ├── 20-sotastream-poster.pdf ├── 20-sotastream-slide.pdf ├── 21_LiFE_Poster.pdf ├── 21_LiFE_Slide.pdf ├── 22-rumour-detection-in-the-wild-poster.pdf ├── 22-rumour-detection-in-the-wild-slide.pdf ├── 23-LaTeX-Rainbow-Poster.pdf ├── 23-LaTeX-Rainbow-Slide.pdf ├── 24-LP-DeepZensolsLightingPres.pdf ├── 24-Poster-DeepZensolsPoster.pdf ├── 25-SeqScore-Poster.pdf ├── 25-SeqScore-Slide.pdf ├── 26-torchdistill_nlp-oss2023_poster.pdf ├── 26-torchdistill_nlp-oss2023_slide.pdf ├── 27-captum-poster.pdf ├── 27-captum-slide.pdf ├── 28-nerblackbox-poster.pdf ├── 28-nerblackbox-presentation.pdf ├── 29-NewsSignals-lightning-slide.pdf ├── 29-NewsSignals-poster.pdf ├── 30-PyTAIL-Poster.pdf ├── 30-PyTAIL-Slide.pdf ├── 32_Antarlekhaka_A_Comprehensive_Tool_for_Multi-task_Natural_Language_Annotation_Poster.pdf ├── 32_Antarlekhaka_A_Comprehensive_Tool_for_Multi-task_Natural_Language_Annotation_Slide.pdf ├── 33-GPTCache-poster.pdf ├── 33-GPTCache-slider.pdf ├── Check-posters.ipynb ├── Findings-The_Vault_23_10.pdf ├── Findings-The_Vault_23_10_Poster.pdf ├── Findings-The_Vault_23_10_Slide.pdf ├── README.md ├── lightning-session-1.pdf ├── lightning-slides-2-compressed.pdf └── lightning-slides-2.pdf /CFP-One.md: -------------------------------------------------------------------------------- 1 | (apologies for cross-posting) 2 | 3 | ---------------------------------------------------------------- 4 | 5 | *Workshop for NLP Open Source Software (NLP-OSS)* 6 | 7 | 06 Dec 2023, Co-located with EMNLP 2023 8 | 9 | https://nlposs.github.io/ 10 | 11 | Deadline for Long and Short Paper submission: 09 August, 2023 12 | (23:59, GMT-11) 13 | 14 | ---------------------------------------------------------------- 15 | 16 | You have tried to use the latest, bestest, fastest LLM models and bore grievances but found the solution after hours of coffee and computer staring. Share that at NLP-OSS and suggest how open source could change for the better (e.g. best practices, documentation, API design etc.) 17 | 18 | You came across an awesome SOTA system on NLP task X that no LLM has beaten its F1 score. But now the code is stale and it takes a dinosaur to understand the code. Share your experience at NLP-OSS and propose how to "replicate" these forgotten systems. 19 | 20 | You see this shiny GPT from a blog post, tried it to reproduce similar results on a different task and it just doesn't work on your dataset. You did some magic to the code and now it works. Show us how you did it! Though they're small tweaks, well-motivated and empirically tested are valid submissions to NLP-OSS. 21 | 22 | You have tried 101 NLP tools and there's none that really do what you want. So you wrote your own shiny new package and made it open source. Tell us why your package is better than the existing tools. How did you design the code? Is it going to be a one-time thing? Or would you like to see thousands of people using it? 23 | 24 | You have heard enough of open-source LLM and pseudo-open-source GPT but not enough about how it can be used for your use-case or your commercial product at scale. So you contacted your legal department and they explained to you about how data, model and code licenses work. Sharing the knowledge with the NLP-OSS community. 25 | 26 | You have a position/opinion to share about free vs open vs closed source LLMs and have valid arguments, references or survey/data to support your position. We would want to hear more about it. 27 | 28 | At last, you've found the avenue to air these issues in an academic platform at the NLP-OSS workshop!!! 29 | 30 | Sharing your experiences, suggestions and analysis from/of NLP-OSS 31 | 32 | 33 | P/S: 34 | 35 | 1st CALL FOR PAPERS 36 | ==== 37 | 38 | ---------------------------------------------------------------- 39 | 40 | *Workshop for NLP Open Source Software (NLP-OSS)* 41 | 42 | 06 Dec 2023, Co-located with EMNLP 2023 43 | 44 | https://nlposs.github.io/ 45 | 46 | Deadline for Long and Short Paper submission: 09 August, 2023 47 | (23:59, GMT-11) 48 | 49 | ---------------------------------------------------------------- 50 | 51 | The Third Workshop for NLP Open Source Software (NLP-OSS) will be co-located 52 | with EMNLP 2023 on 06 Dec 2023. 53 | 54 | Focusing more on the social and engineering aspect of NLP software 55 | and less on scientific novelty or state-of-art models, the Workshop for NLP-OSS 56 | is an academic forum to advance open source developments for NLP research, 57 | teaching and application. 58 | 59 | NLP-OSS also provides an academic workshop to announce new software/features, 60 | promote the collaborative culture and best practices that go beyond the conferences. 61 | 62 | We invite full papers (8 pages) or short papers (4 pages) on topics related to 63 | NLP-OSS broadly categorized into (i) software development, (ii) scientific 64 | contribution and (iii) NLP-OSS case studies. 65 | 66 | - **Software Development** 67 | - Designing and developing NLP-OSS 68 | - Licensing issues in NLP-OSS 69 | - Backwards compatibility and stale code in NLP-OSS 70 | - Growing, maintaining and motivating an NLP-OSS community 71 | - Best practices for NLP-OSS documentation and testing 72 | - Contribution to NLP-OSS without coding 73 | - Incentivizing OSS contributions in NLP 74 | - Commercialization and Intellectual Property of NLP-OSS 75 | - Defining and managing NLP-OSS project scope 76 | - Issues in API design for NLP 77 | - NLP-OSS software interoperability 78 | - Analysis of the NLP-OSS community 79 | 80 | - **Scientific Contribution** 81 | - Surveying OSS for specific NLP task(s) 82 | - Demonstration, introductions and/or tutorial of NLP-OSS 83 | - Small but useful NLP-OSS 84 | - NLP components in ML OSS 85 | - Citations and references for NLP-OSS 86 | - OSS and experiment replicability 87 | - Gaps between existing NLP-OSS 88 | - Task-generic vs task-specific software 89 | 90 | 91 | - **Case studies** 92 | - Case studies of how a specific bug is fixed or feature is added 93 | - Writing wrappers for other NLP-OSS 94 | - Writing open-source APIs for open data 95 | - Teaching NLP with OSS 96 | - NLP-OSS in the industry 97 | 98 | Submission should be formatted according to the [EMNLP 2023 templates](https://2023.emnlp.org/call-for-papers) and submitted to [OpenReview](https://openreview.net/group?id=EMNLP/2023/Workshop/NLP-OSS) 99 | 100 | 101 | ORGANIZERS 102 | 103 | Geeticka Chauhan, Massachusetts Institute of Technology 104 | Dmitrijs Milajevs, Grayscale AI 105 | Elijah Rippeth, University of Maryland 106 | Jeremy Gwinnup, Air Force Research Laboratory 107 | Liling Tan, Amazon 108 | -------------------------------------------------------------------------------- /CFP-Tweets.md: -------------------------------------------------------------------------------- 1 | These CFP tweets can be used with scheduled tweets feature, when promoting the CFP. 2 | 3 | ---- 4 | 5 | You see this shiny *GPT model, but it doesn't work on your data. You did some magic to the code and now it works. Show us how you did it! Small tweaks, well-motivated and empirically tested are valid submissions to NLP-OSS 6 | 7 | https://nlposs.github.io/2023/index.html#call-for-papers 09 Aug 2023 8 | 9 | #nlproc #nlposs 10 | 11 | ---- 12 | 13 | You have tried 101 NLP tools and there's none that really do what you want. So you wrote your own shiny new package and made it open source. Tell the world about it at NLP-OSS! 14 | 15 | https://nlposs.github.io/2023/index.html#call-for-papers 09 Aug 2023 16 | 17 | #nlproc #nlposs 18 | 19 | ---- 20 | 21 | Managed to get hundreds, thousands or millions of user to your #nlproc open source tool? How did you design the code? Did users' feedback help you improve your library? Share your knowledge at NLP-OSS! 22 | 23 | https://nlposs.github.io/2023/index.html#call-for-papers 09 Aug 2023 24 | 25 | #nlproc #nlposs 26 | 27 | ---- 28 | 29 | You have tried the latest LLM models, bore grievances but found the solution after hours of coffee. Share at NLP-OSS and suggest how open source could change for the better (e.g. best practices, documentation, API design) 30 | 31 | https://nlposs.github.io/2023/index.html#call-for-papers 09 Aug 2023 32 | 33 | #nlproc #nlposs 34 | 35 | ----- 36 | 37 | 38 | You came across an awesome SOTA library on NLP task X that no LLM has beaten its F1 score. But now it takes a dinosaur to understand the stale code. Share your experience at NLP-OSS and propose how to "replicate" the library. 39 | 40 | https://nlposs.github.io/2023/index.html#call-for-papers 09 Aug 2023 41 | 42 | #nlproc #nlposs 43 | 44 | ---- 45 | 46 | Last day to submit to NLP Open Source Software workshop!! Have a blogpost or jupyter notebook to share in the last minute? Format it as an ACL paper like https://aclanthology.org/W18-2509/ =) 47 | 48 | https://nlposs.github.io/2023/index.html#call-for-papers 09 Aug 2023 49 | 50 | #nlproc #nlposs 51 | 52 | ---- 53 | -------------------------------------------------------------------------------- /Figure-Out-OpenReview.md: -------------------------------------------------------------------------------- 1 | Adding Reviewers to Pool 2 | ==== 3 | 4 | From https://openreview.net/group/edit?id=EMNLP/2023/Workshop/NLP-OSS/Reviewers, somehow I only found a way of manually copy+pasting the list from our PC usernames to comma-separated format then clicking `Add to Group` in batches. Unlike softconf, there's no way to add a csv or a one line per user file to the system. 5 | 6 | Then after adding the reviewers there, we can start using the `Select All` and `Message Selected` in the same https://openreview.net/group/edit?id=EMNLP/2023/Workshop/NLP-OSS/Reviewers to communicate with the PC. 7 | 8 | 9 | I can't figure out how to formally invite them as described on: 10 | 11 | - https://docs.openreview.net/getting-started/hosting-a-venue-on-openreview/navigating-your-venue-pages 12 | - https://docs.openreview.net/how-to-guides/managing-groups/how-to-recruit-and-remind-recruited-reviewers 13 | 14 | Whenever I go to https://openreview.net/group/edit?id=EMNLP/2023/Workshop/NLP-OSS/Reviewers/Invited page, it just throws an error: 15 | 16 | > Error 404 17 | > The server responded with the following message: 18 | > Group Not Found: EMNLP/2023/Workshop/NLP-OSS/Reviewers/Invited 19 | > If you'd like to report this error to the developers, please use the Feedback form. 20 | 21 | 22 | Most probably the right way to do it is from https://openreview.net/group?id=EMNLP/2023/Workshop/NLP-OSS/Program_Chairs#venue-configuration and then click on `Full Venue Configuration` then click on `Recruitment`, then the PC invitation form will appear for "New Recruitment" 23 | 24 | 25 | ---- 26 | 27 | Modify Review Form 28 | ==== 29 | 30 | This is supposedly the official docs https://docs.openreview.net/reference/default-forms/default-review-form 31 | 32 | To access the review, first, we start from 33 | 34 | - from https://openreview.net/group?id=EMNLP/2023/Workshop/NLP-OSS/Program_Chairs#venue-configuration 35 | - click on `Full Venue Configuration`, 36 | - then after it gets to the conference/workshop configuration page, click on `Review Stage` 37 | - which will open up the "New Review Stage" page, on there, 38 | - set the `Review Start Date` to the conference/workshop paper submission deadline 39 | - set the `Review Deadline` for the official review deadline from the program committee 40 | - set the `Review Expiration Date` (usually 1-2 days before the notification of the acceptance/rejection results) 41 | - change the `Review Rating Field Name` 42 | - change the `Review Confidence Field Name` 43 | - edit the `Additional Review Form Options` with a JSON, currently using: 44 | 45 | ``` 46 | { 47 | "review": { 48 | "value": { 49 | "param": { 50 | "type": "string", 51 | "minLength": 1, 52 | "maxLength": 20000, 53 | "input": "textarea", 54 | "markdown": true 55 | } 56 | }, 57 | "order": 1, 58 | "description": "What is this paper about and what contributions does it make? Please describe what problem or question this paper addresses, and the main contributions that it makes towards a solution or answer. The following kinds of contributions listed on our CFP are all welcomed https://nlposs.github.io/2023/index.html#call-for-papers" 59 | }, 60 | "appropriateness": { 61 | "value": { 62 | "param": { 63 | "type": "string", 64 | "enum": [ 65 | "5: Certainly", 66 | "4: Probably", 67 | "3: Unsure", 68 | "2: Probably not", 69 | "1: Certainly not" 70 | ], 71 | "input": "radio" 72 | } 73 | }, 74 | "description": "Does this paper fit in the NLP-OSS workshop? Most papers should be in the 'certainly' 5 category. We accept various contributions ranging from empirical results, system demonstrations, case studies and theoretical / position discussions results are welcome.", 75 | "order": 2 76 | }, 77 | "clarity": { 78 | "value": { 79 | "param": { 80 | "type": "string", 81 | "enum": [ 82 | "5: Very clear", 83 | "4: Understandable by most readers", 84 | "3: Mostly understandable to me with some effort", 85 | "2: Important questions were hard to resolve even with effort", 86 | "1: Much of the paper is confusing" 87 | ], 88 | "input": "radio" 89 | } 90 | 91 | }, 92 | "description": "Is the discussion/experiments/tool was clearly described in the paper? For the reasonably well-prepared reader, is it clear what was done and why? Is the paper well-written and well-structured?", 93 | "order": 3 94 | }, 95 | "soundness": { 96 | "value": { 97 | "param": { 98 | "type": "string", 99 | "enum": [ 100 | "5: Excellent: This study is one of the most thorough I have seen, given its type.", 101 | "4: Strong: This study provides sufficient support for all of its claims/arguments. ", 102 | "3: Good: This study provides sufficient support for its major claims/arguments, some minor points may need extra support or details.", 103 | "2: Borderline: Some of the main claims/arguments are not sufficiently supported, there are major technical/methodological problems", 104 | "1: Poor: This study is not yet sufficiently thorough to warrant publication or is not relevant to NLP-OSS." 105 | ], 106 | "input": "radio" 107 | } 108 | 109 | }, 110 | "description": "How sound and thorough is this study? First, is the technical approach sound and well-chosen? Second, can one trust the claims of the paper -- are they supported by proper experiments, proofs, or other argumentation?", 111 | "order": 4 112 | }, 113 | "recommendation": { 114 | "value": { 115 | "param": { 116 | "type": "string", 117 | "enum": [ 118 | "5: Exciting: I'd fight to get it accepted", 119 | "4: Worthy: I would like to see it accepted", 120 | "3: Borderline: I'm ambivalent about this one", 121 | "2: Mediocre: I'd rather not see it in the conference", 122 | "1: Poor: I'd fight to have it rejected" 123 | ], 124 | "input": "radio" 125 | } 126 | }, 127 | "description": "Should the paper be accepted or rejected? There are many good submissions competing for slots at this event; how important is it to feature this one? Will people learn a lot by reading this paper or seeing it presented? In deciding on your ultimate recommendation, please think over all your scores above. But remember that no paper is perfect, and remember that we want a conference full of interesting, diverse, and timely work. If a paper has some weaknesses, but you really got a lot out of it, feel free to fight for it. If a paper is solid but you could live without it, let us know that you're ambivalent. Remember also that the author has a couple of weeks to address reviewer comments before the camera-ready deadline. ", 128 | "order": 5 129 | }, 130 | "open_source": { 131 | "value": { 132 | "param": { 133 | "type": "string", 134 | "enum": [ 135 | "Yes: Work done is open sourced or discussion describing open source related issues", 136 | "No: Work done is mainly describing unknown licensed tools / not free nor open software", 137 | "Unclear: Work done has not specified whether the tools demonstrated or described is open source" 138 | ], 139 | "input": "radio" 140 | } 141 | 142 | }, 143 | "description": "We want to make sure that NLP-OSS remains a venue to discuss open source tools and related issues. If the paper (i) has described experiments, features or issues that is explicitly non-open or not free source without any comparison to open source counter parts, or (ii) has described or demonstrated a tool with unclear licensing, please flag them out in your answer. We want to discourage (i) submissions and re-direct them to other appropriate venues and hopefully shepherd (ii) authors towards a clear licensing decision.", 144 | "order": 6 145 | }, 146 | "confidence": { 147 | "value": { 148 | "param": { 149 | "type": "string", 150 | "enum": [ 151 | "5: Positive that my evaluation is correct. I read the paper very carefully and I am very familiar with related work.", 152 | "4: Quite sure. I tried to check the important points carefully. It's unlikely, though conceivable, that I missed something that should affect my ratings.", 153 | "3: Pretty sure, but there's a chance I missed something. Although I have a good feel for this area in general, I did not carefully check the paper's details, e.g., the math, experimental design, or novelty.", 154 | "2: Willing to defend my evaluation, but it is fairly likely that I missed some details, didn't understand some central points, or can't be sure about the novelty of the work.", 155 | "1: Not my area, or paper was hard for me to understand. My evaluation is just an educated guess." 156 | ], 157 | "input": "radio" 158 | } 159 | 160 | }, 161 | "description": "How confident are you in your assessment of this paper?", 162 | "order": 7 163 | } 164 | } 165 | ``` 166 | 167 | 168 | ---- 169 | 170 | Assigning papers to Reviewers 171 | ==== 172 | 173 | From https://openreview.net/group?id=EMNLP/2023/Workshop/NLP-OSS/Program_Chairs#venue-configuration, there's a "Reviewers Paper Assignment" link but whenever I clicked on it, it just throws the error: 174 | 175 | > Error 404 176 | > The server responded with the following message: 177 | > There is currently no assignment configuration ready for use. Please go to your venue request form and use the Paper Matching Setup to compute conflicts and/or affinity scores. 178 | 179 | After some figuring out, the paper matching setup failed the first time but the second time the button never re-appeared. 180 | 181 | And the reviews won't start on the system until the submission deadline closes. A work-around it is after the official deadline, change the time of the end of submission to yesterday. Then click on the review assignment on 182 | 183 | - https://openreview.net/group?id=EMNLP/2023/Workshop/NLP-OSS/Program_Chairs#venue-configuration 184 | - then click on "Reviewers Paper Assignment" 185 | - then the assignment system appears, 186 | - then go back to https://openreview.net/group?id=EMNLP/2023/Workshop/NLP-OSS/Program_Chairs#venue-configuration 187 | - click on "full venue configuration", then in the configuration page, click on "Revision" 188 | - change the time back to extended deadline 189 | -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- 1 | Creative Commons Legal Code 2 | 3 | CC0 1.0 Universal 4 | 5 | CREATIVE COMMONS CORPORATION IS NOT A LAW FIRM AND DOES NOT PROVIDE 6 | LEGAL SERVICES. DISTRIBUTION OF THIS DOCUMENT DOES NOT CREATE AN 7 | ATTORNEY-CLIENT RELATIONSHIP. CREATIVE COMMONS PROVIDES THIS 8 | INFORMATION ON AN "AS-IS" BASIS. CREATIVE COMMONS MAKES NO WARRANTIES 9 | REGARDING THE USE OF THIS DOCUMENT OR THE INFORMATION OR WORKS 10 | PROVIDED HEREUNDER, AND DISCLAIMS LIABILITY FOR DAMAGES RESULTING FROM 11 | THE USE OF THIS DOCUMENT OR THE INFORMATION OR WORKS PROVIDED 12 | HEREUNDER. 13 | 14 | Statement of Purpose 15 | 16 | The laws of most jurisdictions throughout the world automatically confer 17 | exclusive Copyright and Related Rights (defined below) upon the creator 18 | and subsequent owner(s) (each and all, an "owner") of an original work of 19 | authorship and/or a database (each, a "Work"). 20 | 21 | Certain owners wish to permanently relinquish those rights to a Work for 22 | the purpose of contributing to a commons of creative, cultural and 23 | scientific works ("Commons") that the public can reliably and without fear 24 | of later claims of infringement build upon, modify, incorporate in other 25 | works, reuse and redistribute as freely as possible in any form whatsoever 26 | and for any purposes, including without limitation commercial purposes. 27 | These owners may contribute to the Commons to promote the ideal of a free 28 | culture and the further production of creative, cultural and scientific 29 | works, or to gain reputation or greater distribution for their Work in 30 | part through the use and efforts of others. 31 | 32 | For these and/or other purposes and motivations, and without any 33 | expectation of additional consideration or compensation, the person 34 | associating CC0 with a Work (the "Affirmer"), to the extent that he or she 35 | is an owner of Copyright and Related Rights in the Work, voluntarily 36 | elects to apply CC0 to the Work and publicly distribute the Work under its 37 | terms, with knowledge of his or her Copyright and Related Rights in the 38 | Work and the meaning and intended legal effect of CC0 on those rights. 39 | 40 | 1. Copyright and Related Rights. A Work made available under CC0 may be 41 | protected by copyright and related or neighboring rights ("Copyright and 42 | Related Rights"). Copyright and Related Rights include, but are not 43 | limited to, the following: 44 | 45 | i. the right to reproduce, adapt, distribute, perform, display, 46 | communicate, and translate a Work; 47 | ii. moral rights retained by the original author(s) and/or performer(s); 48 | iii. publicity and privacy rights pertaining to a person's image or 49 | likeness depicted in a Work; 50 | iv. rights protecting against unfair competition in regards to a Work, 51 | subject to the limitations in paragraph 4(a), below; 52 | v. rights protecting the extraction, dissemination, use and reuse of data 53 | in a Work; 54 | vi. database rights (such as those arising under Directive 96/9/EC of the 55 | European Parliament and of the Council of 11 March 1996 on the legal 56 | protection of databases, and under any national implementation 57 | thereof, including any amended or successor version of such 58 | directive); and 59 | vii. other similar, equivalent or corresponding rights throughout the 60 | world based on applicable law or treaty, and any national 61 | implementations thereof. 62 | 63 | 2. Waiver. To the greatest extent permitted by, but not in contravention 64 | of, applicable law, Affirmer hereby overtly, fully, permanently, 65 | irrevocably and unconditionally waives, abandons, and surrenders all of 66 | Affirmer's Copyright and Related Rights and associated claims and causes 67 | of action, whether now known or unknown (including existing as well as 68 | future claims and causes of action), in the Work (i) in all territories 69 | worldwide, (ii) for the maximum duration provided by applicable law or 70 | treaty (including future time extensions), (iii) in any current or future 71 | medium and for any number of copies, and (iv) for any purpose whatsoever, 72 | including without limitation commercial, advertising or promotional 73 | purposes (the "Waiver"). Affirmer makes the Waiver for the benefit of each 74 | member of the public at large and to the detriment of Affirmer's heirs and 75 | successors, fully intending that such Waiver shall not be subject to 76 | revocation, rescission, cancellation, termination, or any other legal or 77 | equitable action to disrupt the quiet enjoyment of the Work by the public 78 | as contemplated by Affirmer's express Statement of Purpose. 79 | 80 | 3. Public License Fallback. Should any part of the Waiver for any reason 81 | be judged legally invalid or ineffective under applicable law, then the 82 | Waiver shall be preserved to the maximum extent permitted taking into 83 | account Affirmer's express Statement of Purpose. In addition, to the 84 | extent the Waiver is so judged Affirmer hereby grants to each affected 85 | person a royalty-free, non transferable, non sublicensable, non exclusive, 86 | irrevocable and unconditional license to exercise Affirmer's Copyright and 87 | Related Rights in the Work (i) in all territories worldwide, (ii) for the 88 | maximum duration provided by applicable law or treaty (including future 89 | time extensions), (iii) in any current or future medium and for any number 90 | of copies, and (iv) for any purpose whatsoever, including without 91 | limitation commercial, advertising or promotional purposes (the 92 | "License"). The License shall be deemed effective as of the date CC0 was 93 | applied by Affirmer to the Work. Should any part of the License for any 94 | reason be judged legally invalid or ineffective under applicable law, such 95 | partial invalidity or ineffectiveness shall not invalidate the remainder 96 | of the License, and in such case Affirmer hereby affirms that he or she 97 | will not (i) exercise any of his or her remaining Copyright and Related 98 | Rights in the Work or (ii) assert any associated claims and causes of 99 | action with respect to the Work, in either case contrary to Affirmer's 100 | express Statement of Purpose. 101 | 102 | 4. Limitations and Disclaimers. 103 | 104 | a. No trademark or patent rights held by Affirmer are waived, abandoned, 105 | surrendered, licensed or otherwise affected by this document. 106 | b. Affirmer offers the Work as-is and makes no representations or 107 | warranties of any kind concerning the Work, express, implied, 108 | statutory or otherwise, including without limitation warranties of 109 | title, merchantability, fitness for a particular purpose, non 110 | infringement, or the absence of latent or other defects, accuracy, or 111 | the present or absence of errors, whether or not discoverable, all to 112 | the greatest extent permissible under applicable law. 113 | c. Affirmer disclaims responsibility for clearing rights of other persons 114 | that may apply to the Work or any use thereof, including without 115 | limitation any person's Copyright and Related Rights in the Work. 116 | Further, Affirmer disclaims responsibility for obtaining any necessary 117 | consents, permissions or other rights required for any use of the 118 | Work. 119 | d. Affirmer understands and acknowledges that Creative Commons is not a 120 | party to this document and has no duty or obligation with respect to 121 | this CC0 or use of the Work. 122 | -------------------------------------------------------------------------------- /PC-Invite-Template.md: -------------------------------------------------------------------------------- 1 | Dear ____, 2 | 3 | We would like to invite you to join the Program Committee for the Workshop for NLP Open Source Software (NLP-OSS) in 2023. 4 | 5 | The NLP-OSS workshop is unlike the normal scientific track of academic workshop, our primary objective of this workshop is to further the sharing of insights on the engineering and community aspects of creating, developing and maintaining NLP open source software (OSS) which we seldom talk about in scientific publications. The secondary goal is to promote synergies between different open source projects and encourage cross-software comparisons. 6 | 7 | More details about the previous editions of the workshop workshop can be found on https://nlposs.github.io/2020/index.html 8 | 9 | Your experience with _____ and various NLP OSS is very valuable to the workshop that could be shared with the open source NLP/MT communities. 10 | 11 | The committee comprises of experts like yourself with notable OSS contribution to ML/NLP projects. As a committee member, you can help us to review paper and promote the workshop by encouraging submissions and circulating information. Additionally, we have chosen to openly write the workshop proposal and website; we welcome contributions from you and anyone in the open community https://github.com/nlposs/NLP-OSS. 12 | 13 | Our proposal would be submitted to one of the four conferences and the workshop would be co-located with either: 14 | 15 | | Conference | Location | Time | 16 | |:-|:-|:-| 17 | | EACL | Dubrovnik, Croatia | 2-6 May 2023 | 18 | | ACL | Toronto, Canada | 9-14 July 2023 | 19 | | EMNLP | Singapore | TBD (after July 2023) | 20 | 21 | We would be grateful if you could confirm your participation in the committee by replying to this email or post a comment on . We hope that you confirm your participation soon. 22 | 23 | If you have other PC candidates that you would like to recommend for the workshop, please do contact us. 24 | 25 | With your help, I'm sure we can make NLP-OSS a success and grow the NLP OSS community together! 26 | 27 | Best Regards, 28 | NLP-OSS Organizers 29 | -------------------------------------------------------------------------------- /PC-Preworkshop-Email-NoReplyFollowUp.md: -------------------------------------------------------------------------------- 1 | Dear ____, 2 | 3 | Thank you again for accepting the invitation to Workshop for NLP-OSS 2023 Program Committee! We will be co-located with EMNLP this year on the sunny island of Singapore. 4 | 5 | We have sent you a request to fill out the pre-workshop survey but we haven't heard from you since our invitation. **Please kindly spare a few minutes to fill out this form https://forms.gle/MDKCoD1x44YXALb78 so that we get your name, contact and affiliations right.** 6 | 7 | Thank you in advance for your time in filling up the survey and your support for the NLP-OSS workshop! 8 | 9 | We will be sending out social media and mailing list posts to announce the 2nd CFP in a couple of days. We hope you can help share with your peers, students, and contacts interested in NLP and OSS. We look forward to a fruitful workshop ahead! 10 | 11 | With your help, I'm sure we can make NLP-OSS a success and grow the NLP OSS community together! 12 | 13 | Best Regards, 14 | NLP-OSS Organizers 15 | 16 | PS: If you have received this email but haven't seen your name on the https://nlposs.github.io/2023/index.html#programme-committee, we might have either made an error copy+pasting on the site or you might have yet replied to a previous invitation mail we sent you last fall/winter 2022. Please do fill out the Google form if you're interested and we'll add you to the list. https://forms.gle/MDKCoD1x44YXALb78 17 | 18 | If you have received this email but have previously informed us that you'll skip being PC on this year's workshop, please do let us know, our contact list might have yet updated your interest; if you've changed your mind and is available and would like to join us, please fill out the Google form too. https://forms.gle/MDKCoD1x44YXALb78 19 | -------------------------------------------------------------------------------- /PC-Preworkshop-Email.md: -------------------------------------------------------------------------------- 1 | 2 | Dear ____, 3 | 4 | Thank you again for accepting the invitation to Workshop for NLP-OSS 2023 Program Committee. We will be co-located with EMNLP this year on the sunny island of Singapore. 5 | **Please kindly spare a few minutes to fill out this form https://forms.gle/MDKCoD1x44YXALb78 so that we get your name, contact and affiliations right.** 6 | 7 | Below are the important dates to note for the conference. 8 | And we would like to highlight the paper review period from **25 August to 01 October**. 9 | 10 | | Important Event | Date | 11 | |:-|:-| 12 | | Paper submission: | 09 August, 2023 | 13 | | Paper Reviews Starts: | 25 August 2023 | 14 | | Paper Reviews Due: | 01 October 2023 | 15 | | Notification of Acceptance: | 10 October 2023 | 16 | | Camera-Ready Version: | 25 October 2023 | 17 | | Workshop: | 6 December 2023 | 18 | 19 | We will be sending out social media and mailing list posts to announce the first CFP in a couple of days. We hope you can help share with your peers, students, and contacts interested in NLP and OSS. We look forward to a fruitful workshop ahead! 20 | 21 | With your help, I'm sure we can make NLP-OSS a success and grow the NLP OSS community together! 22 | 23 | Best Regards, 24 | NLP-OSS Organizers 25 | 26 | PS: If you have received this email but haven't seen your name on the https://nlposs.github.io/2023/index.html#programme-committee, we might have either made an error copy+pasting on the site or you might have yet replied to a previous invitation mail we sent you last fall/winter 2022. Please do fill out the Google form if you're interested and we'll add you to the list. https://forms.gle/MDKCoD1x44YXALb78 27 | 28 | If you have received this email but have previously informed us that you'll skip being PC on this year's workshop, please do let us know, our contact list might have yet updated your interest; if you've changed your mind and is available and would like to join us, please fill out the Google form too. https://forms.gle/MDKCoD1x44YXALb78 29 | -------------------------------------------------------------------------------- /PC-Reinvite-Template.md: -------------------------------------------------------------------------------- 1 | Dear ____, 2 | 3 | Thank you again for being part of the Program Committee in the previous Workshops for NLP Open Source Software (NLP-OSS) in 2018/2020. **The workshop would not have been possible without your contribution!** 4 | 5 | We would like to invite you to join the Program Committee for the Workshop for NLP Open Source Software (NLP-OSS) in 2023. 6 | 7 | The NLP-OSS workshop is unlike the normal scientific track of academic workshop, our primary objective of this workshop is to further the sharing of insights on the engineering and community aspects of creating, developing and maintaining NLP open source software (OSS) which we seldom talk about in scientific publications. The secondary goal is to promote synergies between different open source projects and encourage cross-software comparisons. 8 | 9 | Your experience with various NLP OSS is very valuable to the workshop that could be shared with the open source NLP/MT communities. 10 | 11 | The committee comprises of experts like yourself with notable OSS contribution to ML/NLP projects. As a committee member, you can help us to review paper and promote the workshop by encouraging submissions and circulating information. Additionally, we have chosen to openly write the workshop proposal and website; we welcome contributions from you and anyone in the open community https://github.com/nlposs/NLP-OSS/blob/master/Proposal-2023.md 12 | 13 | Our proposal would be submitted to one of the four conferences and the workshop would be co-located with either: 14 | 15 | | Conference | Location | Time | 16 | |:-|:-|:-| 17 | | EACL | Dubrovnik, Croatia | 2-6 May 2023 | 18 | | ACL | Toronto, Canada | 9-14 July 2023 | 19 | | EMNLP | Singapore | TBD (after July 2023) | 20 | 21 | **We would be grateful if you could confirm your participation in the committee by replying to this email or post a comment on https://github.com/nlposs/NLP-OSS/issues/35. We hope that you confirm your participation soon.** 22 | 23 | If you have other PC candidates that you would like to recommend for the workshop, please do contact us. 24 | 25 | With your help, I'm sure we can make NLP-OSS a success and grow the NLP OSS community together! 26 | 27 | Best Regards, 28 | NLP-OSS Organizers 29 | -------------------------------------------------------------------------------- /Proposal-2023.md: -------------------------------------------------------------------------------- 1 | 3rd Workshop for Natural Language Processing Open Source Software (NLP-OSS) 2 | ==== 3 | 4 | With great scientific breakthrough comes solid engineering and open communities. The Natural Language Processing (NLP) community has benefited greatly from the open culture in sharing knowledge, data, and software. The primary objective of this workshop is to further the sharing of insights on the engineering and community aspects of creating, developing, and maintaining NLP open source software (OSS), which we seldom talk about in scientific publications. Our secondary goal is to promote synergies between different open source projects and encourage cross-software collaborations and comparisons. 5 | 6 | We refer to Natural Language Processing OSS as an umbrella term that not only covers traditional syntactic, semantic, phonetic, and pragmatic applications; we extend the definition to include task-specific applications (e.g., machine translation, information retrieval, question-answering systems), low-level string processing that contains valid linguistic information (e.g. Unicode creation for new languages, language-based character set definitions) and machine learning/artificial intelligence frameworks with functionalities focusing on text applications. 7 | 8 | There are many workshops focusing on the creation and curation of open language resources and annotations (e.g. BUCC, GWN, LAW, LOD, WAC). Moreover, we have the flagship LREC conference dedicated to linguistic resources. However, the engineering aspects of NLP-OSS are overlooked and under-discussed within the community. There are open source conferences and venues (such as FOSDEM, OSCON, Open Source Summit) where discussions range from operating system kernels to air traffic control hardware but the representation of NLP related presentations is limited. In the Machine Learning (ML) field, the Journal of Machine Learning Research - Machine Learning Open Source Software (JMLR-MLOSS) is a forum for discussions and dissemination of ML OSS topics. We envision that the Workshop for NLP-OSS becomes a similar avenue for NLP-OSS discussions. 9 | 10 | A decade ago, there was also the SETQA-NLP (Software Engineering, Testing, and Quality Assurance for Natural Language Processing) workshop that raised awareness of the need for good software engineering practices in NLP. In the earlier days of NLP, linguistic software was often monolithic and the learning curve to install, use, and extend the tools was steep and frustrating. More often than not, NLP-OSS developers/users interact in siloed communities within the ecologies of their respective projects. In addition to engineering aspects of NLP software, the open source movement has brought a community aspect that we often overlook in building impactful NLP technologies. 11 | 12 | More recently there have been successful workshops which examine and promote [open science](https://bigscience.huggingface.co/acl-2022) in NLP. While important and complementary, the goals of these workshops are distinct from those of NLP-OSS which focuses more on the community of practice in open-source software in support of NLP and language technologies. We expect many who participated in the BigScience workshop to participate in NLP-OSS as many of the participants are former PC members in past editions of NLP-OSS. Another grassroot community movement, [Eleuther AI](https://www.eleuther.ai/faq/#general) started with the researchers attempting to replicate commercial language models and has since grown to an active decentralized community of volunteer researchers, engineers, and developers focused on AI alignment, scaling, and open source AI research. 13 | 14 | 22 | 23 | With the rise of open source startups like Huggingface, the democratization of NLP gives researchers and the general public easy access to language models once available only to a handful of industrial research labs. This acceleration of NLP tools availability creates new synergies between cloud integrations, e.g. [Huggingface x AWS Sagemaker](https://huggingface.co/docs/sagemaker/index), that allows engineers and researchers to train and deploy live applications with minimal infrastructure setups. Building on the shoulders of giants, the scikit-learn and Huggingface ecosystems are now interoperable under the [`skops` framework](https://blog.scikit-learn.org/updates/community/joining-forces-hugging-face/). 24 | 25 | We want to highlight these emergent communities and synergies in the NLP-OSS workshop and promote future collaborations with like-minded open source NLP researchers in the third NLP-OSS workshop. The first and second NLP-OSS workshop, which was co-located with ACL 2018, was the first workshop in recent years that focused more on building quality software for NLP, open sourcing, developing useful engineering practices, and less on scientific novelty or state-of-art development. We hope that the 3rd NLP-OSS workshop could also be hosted in an *ACL conference, to be an intellectual forum to collate this type of knowledge, announce new software/features, promote the open source culture and best practices that go beyond the conferences. 26 | 27 | 28 | ## Call for Papers 29 | 30 | We invite full papers (8 pages) or short papers (4 pages) on topics related to NLP-OSS broadly categorized into (i) software development, (ii) scientific contribution and (iii) NLP-OSS case studies. 31 | 32 | - **Software Development** 33 | - Designing and developing NLP-OSS 34 | - Licensing issues in NLP-OSS 35 | - Backwards compatibility and stale code in NLP-OSS 36 | - Growing, maintaining and motivating an NLP-OSS community 37 | - Best practices for NLP-OSS documentation and testing 38 | - Contribution to NLP-OSS without coding 39 | - Incentivizing OSS contributions in NLP 40 | - Commercialization and Intellectual Property of NLP-OSS 41 | - Defining and managing NLP-OSS project scope 42 | - Issues in API design for NLP 43 | - NLP-OSS software interoperability 44 | - Analysis of the NLP-OSS community 45 | 46 | - **Scientific Contribution** 47 | - Surveying OSS for specific NLP task(s) 48 | - Demonstration, introductions and/or tutorial of NLP-OSS 49 | - Small but useful NLP-OSS 50 | - NLP components in ML OSS 51 | - Citations and references for NLP-OSS 52 | - OSS and experiment replicability 53 | - Gaps between existing NLP-OSS 54 | - Task-generic vs task-specific software 55 | 56 | 57 | - **Case studies** 58 | - Case studies of how a specific bug is fixed or feature is added 59 | - Writing wrappers for other NLP-OSS 60 | - Writing open-source APIs for open data 61 | - Teaching NLP with OSS 62 | - NLP-OSS in the industry 63 | 64 | 65 | 66 | 67 | ## Demographic Diversity 68 | 69 | **Organizers:** We have 5 organizers with representation from industrial NLP/ML labs, government organization and academic institutes. 70 | 71 | **PC members:** We strive to a have a balance of academic and industrial PC from diverse gender and geolocation demographics. We extended our list of PC members in NLP-OSS 2018 edition by inviting a subset of the WiNLP members on the BIG directory and accepted invitees have since joined us in NLP-OSS 2020 and reinvited in the proposed 2023 edition. 72 | 73 | 74 | 75 | 76 | ## Misc 77 | 78 | **Estimated no. of Attendees**: 50 79 | 80 | **Shared Task**: No 81 | 82 | **Special Requirements / Technical Needs**: No 83 | 84 | **Preferred Venue**: 85 | 86 | 1. EMNLP 87 | 2. ACL 88 | 3. EACL 89 | 90 | 91 | **Previous Workshop**: 92 | 93 | - [First Workshop for NLP-OSS at ACL 2018](https://nlposs.github.io) 94 | - 10 submissions, 9 accepted publications 95 | - 30+ attendees 96 | - 3 invited speakers 97 | 98 | - [Second Workshop for NLP-OSS at ACL 2020](https://nlposs.github.io) 99 | - 32 submissions, 21 accepted publications 100 | - 40+ attendees 101 | - 3 invited speakers 102 | 103 | **Expected no. of submissions**: 30-40 submissions 104 | 105 | ## Organizers 106 | 107 | - [Geeticka Chauhan](https://www.csail.mit.edu/person/geeticka-chauhan), Massachusetts Institute of Technology 108 | 109 | Geeticka Chauhan is a Ph.D. student at MIT, working on NLP for healthcare advised by Prof. Peter Szolovits. Her master thesis focused on revealing the reproducibility and generalizability problems in Relation Extraction, and experimentally showed the importance of streamlining evaluation methods in NLP challenges 110 | 111 | - [Dmitrijs Milajevs](http://zest.id.lv/), Grayscale AI. 112 | 113 | Dmitrijs Milajevs is a data scientist at KMPG. Previously, he evaluated information retrieval systems at National Institute of Standards and Technology (NIST). He has defended a Ph.D. thesis on evaluation of compositional models in distributional semantics. 114 | 115 | - [Elijah Rippeth](https://erip.github.io/), University of Maryland. 116 | 117 | Elijah Rippeth is Ph.D. student at the University of Maryland in the Department of Computer Science. His work focuses broadly on natural language processing, but with a focus on multilingual NLP, cross-lingual transfer, and machine translation. 118 | 119 | - Jeremy Gwinnup, Air Force Research Laboratory. 120 | 121 | Jeremy Gwinnup is a Research Computer Scientist in the Airman Systems Directorate of the Air Force Research Laboratory located in Dayton, Ohio USA. His research focuses on multimodal machine translation and is the topic of his studies as a Doctor of Engineering student at Johns Hopkins University. 122 | 123 | 124 | - [Liling Tan](https://github.com/alvations), Rakuten Institute of Technology 125 | 126 | Liling is a research scientist at Rakuten Institute of Technology working on Machine Translation and developing applications using language technologies. He has been actively involved in corpora creation/maintenance, Asian NLP and machine translation. He co-organized the [Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2014-16)](http://ttg.uni-saarland.de/vardial2016/). 127 | 128 | 129 | ## Programme Committee (Confirmed from pre-proposal survey) 130 | 131 | - Aakanksha Naik, Allen Institute for Artificial Intelligence 132 | - Aitor Soroa, HiTZ Center - Ixa, University of the Basque Country UPV/EHU 133 | - Alexander Rush, Cornell, Hugging Face 134 | - Aline Paes, Universidade Federal Fluminense 135 | - Amittai Axelrod, Apple AI 136 | - Anish Mohan, Nvidia 137 | - Arun Balajiee Lekshmi Narayanan, University of Pittsburgh 138 | - Atnafu lambebo Tonja, Instituto Politécnico Nacional 139 | - Atul Kr. Ojha, University of Galway 140 | - Cassandra Jacobs, University at Buffalo 141 | - Christoph Teichmann, Bloomberg LP 142 | - Daniel Braun, University of Twente 143 | - Dave Howcroft, Edinburgh Napier University 144 | - Diana Maynard, University of Sheffield 145 | - Flammie a Pirinen, University of Norway 146 | - Gérard Dupont, Mavenoid 147 | - Jack Morris, Cornell University 148 | - Jörg Tiedemann, University of Helsinki 149 | - Karin Sim, Language Weaver 150 | - Kevin Cohen, University of Colorado 151 | - Lane Schwartz, University of Alaska Fairbanks 152 | - Leo Boytsov, Amazon 153 | - Lucy Park, Upstage 154 | - Maarten van Gompel, Radboud University 155 | - Maheshwar Ghankot, Indira Gandhi National Open University 156 | - Mallika Singh, Harvard Medical School 157 | - Marco Cognetta, Tokyo Institute of Technology, Google 158 | - Marzieh Fadaee, Zeta Alpha Vector 159 | - Matt Post, Microsoft 160 | - Micah Shlain, Allen Institute for Artificial Intelligence 161 | - Michael Wayne Goodman, LivePerson, Inc. 162 | - Mohd Sanad Zaki Rizvi, University of Edinburgh 163 | - Nelson F. Liu, Stanford University, 164 | - Ogundepo Odunayo, University of Waterloo 165 | - Pasquale Lisena, EURECOM 166 | - Phu Mon Htut, AWS AI Labs 167 | - Raeid Saqur, Princeton University 168 | - Raphael Tang, Comcast Applied AI 169 | - Sagnik Ray Choudhury, University of Michigan 170 | - Shilpa Suresh, Harvard Medical School 171 | - Sina Ahmadi, George Mason University 172 | - Steve DeNeefe, RWS Language Weaver 173 | - Steven Bethard, University of Arizona 174 | - Taha Zerrouki, Bouira University Algeria 175 | - Tenzin Bhotia, Delhi Technological University 176 | - Thomas Kober, Zalando SE 177 | - Tomas Mikolov, Czech Institute of Informatics 178 | - Tommaso Teofili, Roma Tre University 179 | - Vlad Niculae, University of Amsterdam 180 | - Won Ik Cho, Seoul National University 181 | - Zaid Alyafeai, King Fahd University of Petroleum and Minerals 182 | - Ziv Litmanovitz, University of Haifa 183 | 184 | 185 | 186 | ## Programme Committee (Previous PC, To follow up with) 187 | 188 | - Abigail See 189 | - Akiko Eriguchi 190 | - Amandalynne Paullada 191 | - Anca Dumitrache 192 | - Andreas Mueller 193 | - Arwen Griffioen 194 | - Brendan O'Connor 195 | - Carolina Scarton 196 | - Chris Hokamp 197 | - Christian Federmann 198 | - Christopher Manning 199 | - Dan Simonson 200 | - David Przybilla 201 | - Delip Rao 202 | - Denny Britz 203 | - Ehsan Khoddam 204 | - Eleftherios Avramidis 205 | - Emiel van Miltenburg 206 | - Emily Dinan 207 | - Eva Maria Vecchi 208 | - Fabio Kepler 209 | - Francis Bond 210 | - Frédéric Blain 211 | - Graham Neubig 212 | - Grzegorz Chrupała 213 | - Hal Daumé III 214 | - Ian Soboroff 215 | - Ignatius Ezeani 216 | - James Bradbury 217 | - Jason Baldridge 218 | - Jiatao Gu 219 | - Joel Grus 220 | - Joel Nothman 221 | - Jon Dehdari 222 | - Karin Sim Smith 223 | - Kheng Hui Yeo 224 | - Kyunghyun Cho 225 | - Madison May 226 | - Marcel Bollmann 227 | - Marcos Zampieri 228 | - Mary Ellen Foster 229 | - Matthew Honnibal 230 | - Moshe Wasserblat 231 | - Muthu Kumar Chandrasekaran 232 | - Paul Pu Liang 233 | - Philipp Koehn 234 | - Pontus Stenetorp 235 | - Rachael Tatman 236 | - Radim Rehurek 237 | - Rico Sennrich 238 | - Sandya Mannarswamy 239 | - Sang Phan 240 | - Shamil Chollampatt 241 | - Sharat Chikkerur 242 | - Shubhanshu Mishra 243 | - Stephen Sloto 244 | - Svitlana Vakulenko 245 | - Taku Kudo 246 | - Tareq Al-Moslmi 247 | - Tilahun Abedissa Taffa 248 | - Varun Kumar 249 | - Vered Shwartz 250 | - Yusuke Miyao 251 | - Yves Peirsman 252 | 253 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | # Workshop for NLP Open Source Software (NLP-OSS) 2 | 3 | This is the Github page where we keep track of the developments of NLP-OSS workshop. 4 | 5 | The [ACL/COLING/EMNLP/NAACL proposal](https://acl2020.org/calls/workshops/) is at https://github.com/nlposs/NLP-OSS/blob/master/Proposal.md 6 | 7 | Let's use the issue tracker to do the following: 8 | 9 | - confirm the people in the program committee 10 | - track progress of the workshop website 11 | - catch suggestions/comments and grammar errors in the proposal (please create new issues, when appropriate) 12 | - the proposal is unpolished and everything is open to changes =) 13 | 14 | -------------------------------------------------------------------------------- /Reviewing-Period-Start.md: -------------------------------------------------------------------------------- 1 | Dear NLP-OSS Programme Committee members, 2 | 3 | Thank you again for accepting the invitation to Workshop for NLP-OSS 2023 Program Committee! We will be co-located with EMNLP this year on the sunny island of Singapore (06 Dec 2023). 4 | 5 | We are starting our reviewing period, we strive to keep a light reviewing load and give authors helpful feedback to improve the papers and suggestions for their OSS. The reviews are due a month from now (01 Oct 2023). And here's a timeline of the workshop: 6 | 7 | |Important Dates|| 8 | |:-|:-| 9 | |**Paper Reviews Starts:** |31 August 2023| 10 | |**Paper Reviews Due:** |01 October 2023| 11 | |**Notification of Acceptance:** |10 October 2023| 12 | |**Camera-Ready Version:** |25 October 2023| 13 | |**Workshop:** |06 Dec 2023| 14 | 15 | Thank you in advance for the time and effort to do the paper reviews and your support for the NLP-OSS workshop! With your help, I'm sure we can make NLP-OSS a success and grow the NLP OSS community together! 16 | 17 | Best Regards, 18 | NLP-OSS Organizers 19 | -------------------------------------------------------------------------------- /Softconf-Setup.md: -------------------------------------------------------------------------------- 1 | # Setup Submission Parameters 2 | 3 | Go to `Manager Console > Submission Setup > Setup Submission Parameters` 4 | 5 | 1. Set "Enforce maximum length" to 10 at "If yes, enter number of pages allowed:" 6 | 2. Increase "Number of words allowed" for "Accompanying Summary" to 800 chars 7 | 3. Allow "Additional Attachment" 8 | 4. Set "Give authors the possibility to withdraw their submissions?" to Yes 9 | 5. Click on "Save and Preview Basic Submission Page" to check. 10 | 11 | 12 | -------------------------------------------------------------------------------- /Speaker-Invite-1.md: -------------------------------------------------------------------------------- 1 | Dear Jordan, 2 | 3 | We are excited to invite you as our guest speaker for the 3rd Workshop for NLP Open Source Software (NLP-OSS) in year 2023, colocated with EMNLP at Singapore in 06 December. 4 | 5 | We get to know about Spawning AI through using http://haveibeentrained.com/ and we would like to learn more about how Spawning AI built the community around the tool. Beyond images from artists, it'll be good to share if Spawning has plans to extend the endevour to content creators (text/audios) or corpora creators? We would like to invite you to give a talk on any aspect(s) of Open Source Software created by Spawning AI. Your experience with building the artist/creator community will be very valuable to the workshop that could be shared with the ACL community at the NLP-OSS workshop. 6 | 7 | The NLP-OSS workshop is unlike the normal scientific track of academic workshop, our primary objective of this workshop is to further the sharing of insights on the engineering and community aspects of creating, developing and maintaining NLP open source software (OSS) which we seldom talk about in scientific publications. The secondary goal is to promote synergies between different open source projects and encourage cross-software comparisons. Our organizers and committee comprises of experts like yourself with notable OSS contribution to ML/NLP projects. Previous editions of NLP-OSS workshop have hosted creators and maintainers of popular open source software, e.g. Huggingface Transformers, Marian NMT, Stanford CoreNLP, scikit-learn. 8 | 9 | More details about the of the workshop can be found on https://nlposs.github.io/ 10 | 11 | We look forward to your response and hopefully seeing you speak at NLP-OSS 2023! 12 | 13 | Thank you in advance for your response. 14 | 15 | Best Regards, 16 | NLP-OSS Organizers 17 | -------------------------------------------------------------------------------- /Speaker-Invite-2.md: -------------------------------------------------------------------------------- 1 | 2 | Dear Harrison, 3 | 4 | We are excited to invite you as our guest speaker for the 3rd Workshop for NLP Open Source Software (NLP-OSS) in 2023, co-located with EMNLP at Singapore on 06 December. 5 | 6 | We get to know about LangChain through using the ChatGPT through the Langchain API. LangChain has gain popularity and its connection OpenAPI and Huggingface has been a great tools for developers and many ACL/EMNLP researchers would like to learn more about the tools and convenience that LangChain provides. We would like to invite you to give a talk about how LangChain started and scaled to the state of how it is today with 2 millions downloads to date! Your experience with building the LangChain API will be very valuable to the workshop that could be shared with the ACL community at the NLP-OSS workshop. 7 | 8 | The NLP-OSS workshop is unlike the normal scientific track of academic workshops, our primary objective of this workshop is to further the sharing of insights on the engineering and community aspects of creating, developing and maintaining NLP open source software (OSS) which we seldom talk about in scientific publications. The secondary goal is to promote synergies between different open source projects and encourage cross-software comparisons. Our organizers and committee comprises of experts like yourself with notable OSS contribution to ML/NLP projects. Previous editions of NLP-OSS workshop have hosted creators and maintainers of popular open source software, e.g. Huggingface Transformers, Marian NMT, Stanford CoreNLP, scikit-learn. 9 | 10 | More details about the of the workshop can be found on https://nlposs.github.io/ 11 | 12 | We look forward to your response and hopefully seeing you speak at NLP-OSS 2023! 13 | 14 | Thank you in advance for your response. 15 | 16 | Best Regards, NLP-OSS Organizers 17 | -------------------------------------------------------------------------------- /Speaker-Invite-3.md: -------------------------------------------------------------------------------- 1 | 2 | Dear Thomas, 3 | 4 | We are excited to re-invite you or anyone from HuggingFace as our guest speaker for the 3rd Workshop for NLP Open Source Software (NLP-OSS) in year 2023, colocated with EMNLP at Singapore in 06 December. 5 | 6 | It has been a quite a journey since `pytorch-pretrained-bert` and your talk at our last workshop. We would like to invite you to give a talk on any aspect(s) of Open Source Software and the HuggingFace suite of libraries or any other topics on LLMs OSS that you would like to share about. Your experience with building HuggingFace to the scale today is very valuable to the workshop that could be shared with the ACL community at the NLP-OSS workshop. 7 | 8 | The NLP-OSS workshop is unlike the normal scientific track of academic workshop, our primary objective of this workshop is to further the sharing of insights on the engineering and community aspects of creating, developing and maintaining NLP open source software (OSS) which we seldom talk about in scientific publications. The secondary goal is to promote synergies between different open source projects and encourage cross-software comparisons. Our organizers and committee comprises of experts like yourself with notable OSS contribution to ML/NLP projects. Previous editions of NLP-OSS workshop have hosted creators and maintainers of popular open source software, e.g. Huggingface Transformers, Marian NMT, Stanford CoreNLP, scikit-learn. 9 | 10 | More details about the of the workshop can be found on https://nlposs.github.io/ 11 | 12 | We look forward to your response and hopefully seeing you speak at NLP-OSS 2023! 13 | 14 | Thank you in advance for your response. 15 | 16 | Best Regards, NLP-OSS Organizers 17 | -------------------------------------------------------------------------------- /Speaker-Invite-OAI.md: -------------------------------------------------------------------------------- 1 | Dear OpenAI co-founders and executives, 2 | 3 | We are excited to invite you as our guest speaker(s) for the 3rd Workshop for NLP Open Source Software (NLP-OSS) in year 2023, colocated with EMNLP at Singapore in 06 December. 4 | 5 | ChatGPT has brought about fresh perspectives on large language models, NLP applications and sparked a cambrian explosion of open source NLP tools. Little can our words describe the impact OpenAI has on the NLP academia and industry, and it would be good to hear from OpenAI about their journey from pre-GPT, to open source GPT-2 to the GPT-4 we know and use today, and the limits and risks of open sourcing OpenAI products. Your experience with building the an NLP startup will be inspiring and very valuable to the workshop that could be shared with the ACL community at the NLP-OSS workshop. 6 | 7 | The NLP-OSS workshop is unlike the normal scientific track of academic workshop, our primary objective of this workshop is to further the sharing of insights on the engineering and community aspects of creating, developing and maintaining NLP open source software (OSS) which we seldom talk about in scientific publications. The secondary goal is to promote synergies between different open source projects and encourage cross-software comparisons. Our organizers and committee comprises of experts like yourself with notable OSS contribution to ML/NLP projects. Previous editions of NLP-OSS workshop have hosted creators and maintainers of popular open source software, e.g. Huggingface Transformers, Marian NMT, Stanford CoreNLP, scikit-learn. 8 | 9 | More details about the of the workshop can be found on https://nlposs.github.io/ 10 | 11 | We look forward to your response and hopefully seeing you speak at NLP-OSS 2023! 12 | 13 | Thank you in advance for your response. 14 | 15 | Best Regards, NLP-OSS Organizers 16 | -------------------------------------------------------------------------------- /Sponsorship-Invite-Template.md: -------------------------------------------------------------------------------- 1 | We would like to invite you as sponsor of the Workshop for NLP Open Source Software (NLP-OSS) in ` () in , `. 2 | 3 | The Workshop for NLP-OSS is the gathering of Natural Language Processing (NLP) researchers and practitioners researching, developing and using Open Source Softwares (OSS). Our primary objective of this workshop is to further the sharing of insights on the engineering and community aspects of creating, developing, and maintaining NLP-OSS which we seldom talk about in scientific publications. 4 | 5 | Your sponsorship helps keep NLP-OSS sustainable to widest possible audience. The NLP-OSS workshop is organized by volunteers from both academia and industry. Sponsorship goes to covering the cost of invited speakers, and the benefits of sponsorship is as follows: 6 | 7 | - Show support of the NLP-OSS community 8 | - Brand awareness and recognition 9 | - Talent acquisition and recruitment to researchers and practitioners 10 | 11 | Depending on your level of sponsorship, packages include keynote seat drop, suggesting an invited talk, logo on webpage and plenary session branding. 12 | 13 | **Sponsorship benefits (Details):** 14 | 15 | | Gold | Silver | Bronze | 16 | |:-|:-|:-| 17 | | - Suggest an invited talk (20 mins)
- Keynote seat drop (materials provided by sponsors)
- Get list of papers authors and workshop participants (who permit to share their names and contacts in post-workshop survey)
- Company logo on website | - Keynote seat drop (materials provided by sponsors)
- Company logo on website | - Company logo on website | 18 | 19 | 20 | With your help, I'm sure we can make NLP-OSS a success and grow the NLP-OSS community together! If you require more information about the sponsorship, feel free to contact us at nlposs.workshop@gmail.com 21 | 22 | We look forward to hearing from you! 23 | 24 | Regards from organizers, 25 | 26 | Lucy Park, NAVER Corp. 27 | 28 | Masato Hagiwara, Octanove Labs 29 | 30 | Nelson Liu, Stanford University 31 | 32 | Dmitrijs Milajevs, NIST 33 | 34 | -------------------------------------------------------------------------------- /decisions.md: -------------------------------------------------------------------------------- 1 | Dear {{fullname}}, 2 | 3 | On behalf of the NLP-OSS 2023 workshop Program Committee, I am delighted to inform you that the following submission has been accepted to appear at the workshop: 4 | 5 | {{submission_title}} 6 | 7 | The Program Committee worked very hard to thoroughly review all the submitted papers. Please repay their efforts, by following their suggestions when you revise your paper. You can find the final reviews for your paper on the submission page in OpenReview at: {{forum_url}} 8 | 9 | **IMPORTANT:** Please take into considerations the suggestions / TODOs made and questions raised by the reviewers and address them in the final submission! 10 | 11 | When you have revised your paper, you can upload your final manuscript on your submission's openreview page {{forum_url}}, click on "Edit > Submission" button on the top right (under the "PDF" button. 12 | 13 | We will follow up with several emails before the workshop to confirm your in-person/virtual participation and announce the program schedule for the workshop in the upcoming weeks. 14 | 15 | If you have any additional questions, please feel free to get in touch. 16 | 17 | Best Regards, 18 | NLP-OSS 2023 Organizers 19 | 20 | 21 | ----- 22 | 23 | 24 | Dear {{fullname}}, 25 | 26 | I am sorry to inform you that the following submission was not selected by the program committee to appear at NLP-OSS 2023: 27 | 28 | {{submission_title}} 29 | 30 | The selection process was competitive. Due to time and space limitations, we could only choose a small number of the submitted papers to appear on the program. Nonetheless, we still hope you can attend the conference. 31 | 32 | We have enclosed the reviewer comments for your perusal. You can find the final reviews for your paper on the submission page in OpenReview at: {{forum_url}} 33 | 34 | If you have any additional questions, please feel free to get in touch. 35 | 36 | Best Regards, 37 | NLP-OSS 2023 Organizers 38 | 39 | -------------------------------------------------------------------------------- /nlposs-2018/CFP-2018.md: -------------------------------------------------------------------------------- 1 | CALL FOR PAPERS 2 | ==== 3 | 4 | -------------------------------------------------------------------------------- 5 | *Workshop for NLP Open Source Software (NLP-OSS)* 6 | 20 July 2018, Co-located with ACL 2018 7 | https://nlposs.github.io/ 8 | 9 | Deadline for Long and Short Paper submission: 25th March 2018 10 | (23:59, GMT-11) 11 | --------------------------------------------------------------------------------- 12 | 13 | The First Workshop for NLP Open Source Software (NLP-OSS) will be co-located 14 | with ACL 2018 at Melbourne, Australia on 20 July 2018. 15 | 16 | Focusing more on the social and engineering aspect of NLP software 17 | and less on scientific novelty or state-of-art models, the Workshop for NLP-OSS 18 | is an academic forum to advance open source developments for NLP research, 19 | teaching and application. 20 | 21 | NLP-OSS also provides an academic workshop to announce new software/features, 22 | promote the collaborative culture and best practices that go beyond the conferences. 23 | 24 | We invite full papers (8 pages) or short papers (4 pages) on topics related to 25 | NLP-OSS broadly categorized into (i) software development, (ii) scientific 26 | contribution and (iii) NLP-OSS case studies. 27 | 28 | - **Software Development** 29 | - Designing and developing NLP-OSS 30 | - Licensing issues in NLP-OSS 31 | - Backwards compatibility and stale code in NLP-OSS 32 | - Growing an NLP-OSS community 33 | - Maintaining and motivating an NLP-OSS community 34 | - Best practices for NLP-OSS documentation and testing 35 | - Contribution to NLP-OSS without coding 36 | - Incentivizing OSS contributions in NLP 37 | - Commercialization and Intellectual Property of NLP-OSS 38 | - Defining and managing NLP-OSS project scope 39 | - Issues in API design for NLP 40 | - NLP-OSS software interoperability 41 | - Analysis of the NLP-OSS community 42 | 43 | - **Scientific Contribution** 44 | - Surveying OSS for specific NLP task(s) 45 | - Demonstration and tutorial of NLP-OSS 46 | - New NLP-OSS introductions 47 | - Small but useful NLP-OSS 48 | - NLP components in ML OSS 49 | - Citations and references for NLP-OSS 50 | - OSS vs experiment replicability 51 | - Gaps between existing NLP-OSS 52 | - Task-generic vs task-specific software 53 | 54 | - **Case studies** 55 | - Case studies of how a specific bug is fixed or feature is added 56 | - Writing wrappers for other NLP-OSS 57 | - Writing open-source APIs for open data 58 | - Teaching NLP with OSS 59 | - NLP-OSS in the industry 60 | 61 | Submission should be formatted according to the [ACL2018 templates](http://acl2018.org/call-for-papers/) 62 | Softconf site: https://www.softconf.com/acl2018/NLPOSS 63 | 64 | ORGANIZERS 65 | 66 | Lucy Park, NAVER Corp. 67 | Masato Hagiwara, Duolingo Inc. 68 | Dmitrijs Milajevs, NIST and Queen Mary University of London 69 | Liling Tan, Rakuten Institute of Technology 70 | -------------------------------------------------------------------------------- /nlposs-2018/Proposal-2018.md: -------------------------------------------------------------------------------- 1 | Workshop for Natural Language Processing Open Source Software (NLP-OSS) 2 | ==== 3 | 4 | 5 | With great scientific breakthrough comes solid engineering and open communities. The Natural Language Processing (NLP) community has benefited greatly from the open culture in sharing knowledge, data, and software. The primary objective of this workshop is to further the sharing of insights on the engineering and community aspects of creating, developing, and maintaining NLP open source software (OSS) which we seldom talk about in scientific publications. Our secondary goal is to promote synergies between different open source projects and encourage cross-software collaborations and comparisons. 6 | 7 | We refer to Natural Language Processing OSS as an umbrella term that not only covers traditional syntactic, semantic, phonetic, and pragmatic applications; we extend the definition to include task-specific applications (e.g., machine translation, information retrieval, question-answering systems), low-level string processing that contains valid linguistic information (e.g. Unicode creation for new languages, language-based character set definitions) and machine learning/artificial intelligence frameworks with functionalities focusing on text applications. 8 | 9 | There are many workshops focusing open language resource/annotation creation and curation (e.g. BUCC, GWN, LAW, LOD, WAC). Moreover, we have the flagship LREC conference dedicated to linguistic resources. However, the engineering aspects of NLP OSS is overlooked and under-discussed within the community. There are open source conferences and venues (such as FOSDEM, OSCON, Open Source Summit) where discussions range from operating system kernels to air traffic control hardware but the representation of NLP related presentations is limited. In the Machine Learning (ML) field, the Journal of Machine Learning Research - Machine Learning Open Source Software (JMLR-MLOSS) is a forum for discussions and dissemination of ML OSS topics. We envision that the Workshop for NLP-OSS becomes a similar avenue for NLP OSS discussions. 10 | 11 | To our best knowledge, this is the first workshop proposal in the recent years that focuses more on the building aspect of NLP and less on scientific novelty or state-of-art development. A decade ago, there was the SETQA-NLP (Software Engineering, Testing, and Quality Assurance for Natural Language Processing) workshop that raised awareness of the need for good software engineering practices in NLP. In the earlier days of NLP, linguistic software was often monolithic and the learning curve to install, use, and extend the tools was steep and frustrating. More often than not, NLP OSS developers/users interact in siloed communities within the ecologies of their respective projects. In addition to engineering aspects of NLP software, the open source movement has brought a community aspect that we often overlook in building impactful NLP technologies. 12 | 13 | One example of NLP OSS synergy is NLTK’s support for Stanford NLP tools which provide a Pythonic interface to the Stanford tools written in Java. More recently, the REST-ful API from Stanford CoreNLP tools has alleviated a host of issues that are related to cross-OSS interfaces in NLTK (c.f. https://github.com/nltk/nltk/pull/1249). The developers have also interacted across their respective code repositories to raise issues and give code reviews. Beyond the diamond-sharpening effect of cross-OSS collaborations, the result of the successful interface between the tools opens the door to easily benchmark annotations created by NLTK and Stanford CoreNLP. 14 | 15 | Another example of precious OSS knowledge comes from SpaCy developer [Montani (2017)](https://ines.io/blog/spacy-commercial-open-source-nlp), who shared her thoughts and challenges of maintaining commercial NLP OSS, such as handling open issues on the issue tracker, model release and packaging strategy and monetizing NLP OSS for sustainability. 16 | 17 | We hope that the NLP-OSS workshop becomes the intellectual forum to collate this type of knowledge, announce new software/features, promote the open source culture and best practices that go beyond the conferences. 18 | 19 | 20 | ## Call for Papers 21 | 22 | We invite full papers (8 pages) or short papers (4 pages) on topics related to 23 | NLP-OSS broadly categorized into (i) software development, (ii) scientific 24 | contribution and (iii) NLP-OSS case studies. 25 | 26 | - **Software Development** 27 | - Designing and developing NLP-OSS 28 | - Licensing issues in NLP-OSS 29 | - Backwards compatibility and stale code in NLP-OSS 30 | - Growing an NLP-OSS community 31 | - Maintaining and motivating an NLP-OSS community 32 | - Best practices for NLP-OSS documentation and testing 33 | - Contribution to NLP-OSS without coding 34 | - Incentivizing OSS contributions in NLP 35 | - Commercialization and Intellectual Property of NLP-OSS 36 | - Defining and managing NLP-OSS project scope 37 | - Issues in API design for NLP 38 | - NLP-OSS software interoperability 39 | - Analysis of the NLP-OSS community 40 | 41 | - **Scientific Contribution** 42 | - Surveying OSS for specific NLP task(s) 43 | - Demonstration and tutorial of NLP-OSS 44 | - New NLP-OSS introductions 45 | - Small but useful NLP-OSS 46 | - NLP components in ML OSS 47 | - Citations and references for NLP-OSS 48 | - OSS vs experiment replicability 49 | - Gaps between existing NLP-OSS 50 | - Task-generic vs task-specific software 51 | 52 | - **Case studies** 53 | - Case studies of how a specific bug is fixed or feature is added 54 | - Writing wrappers for other NLP-OSS 55 | - Writing open-source APIs for open data 56 | - Teaching NLP with OSS 57 | - NLP-OSS in the industry 58 | 59 | 60 | 61 | ## Organizers 62 | 63 | - [Lucy Park](https://github.com/e9t), NAVER Corp. 64 | 65 | Lucy is a research scientist at NAVER. During her Ph.D. studies at Seoul National University, she investigated better representations for text, and started several open source projects such as [KoNLPy](http://konlpy.org). Her current research area is machine translation -- focused on user log analysis and multilingual NMT. 66 | 67 | - [Masato Hagiwara](http://masatohagiwara.net/), Duolingo Inc. 68 | 69 | Masato Hagiwara is a research scientist / software engineer working at Duolingo. He received his Ph.D. degree in Information Science from Nagoya University in 2009. During his Ph.D., he worked at Google and Microsoft Research as an intern, and thereafter at Baidu Japan, focusing on search engine-related language processing research. Before joining Duolingo, he was a lead scientist at Rakuten Institute of Technology in New York. His research interests include Japanese and Chinese language processing, machine translation / transliteration, and language education. He received several paper awards from Japanese domestic conferences for his work on knowledge acquisition and transliteration. He also co-organized a couple of workshops and special sessions in Japan on noisy text processing. 70 | 71 | - [Dmitrijs Milajevs](http://www.eecs.qmul.ac.uk/~dm303/), NIST and Queen Mary University of London. dimazest@gmail.com 72 | 73 | Dmitrijs is a guest researcher at National Institute of Standards and Technology (NIST) working on the evaluation of information retrieval systems. He has just defended a PhD thesis on evaluation of compositional models in distributional semantics. He was responsible for a website for the [11th International Conference on Computational Semantics (IWCS 2015)](http://iwcs2015.github.io/) and a collocated hackathon. 74 | 75 | - [Liling Tan](https://github.com/alvations), Rakuten Institute of Technology 76 | 77 | Liling is a research scientist at Rakuten Institute of Technology working on Machine Translation and developing applications using language technologies. He has been actively involved in corpora creation/maintenance, Asian NLP and machine translation. He co-organized the [Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2014-16)](http://ttg.uni-saarland.de/vardial2016/) and the DSL shared task. 78 | 79 | 80 | ## Invited Speakers (Confirmed, depends on availability of speakers after conference assignment) 81 | 82 | - [Christopher Manning](https://nlp.stanford.edu/manning/), Stanford University 83 | - [Matthew Honnibal and Ines Montani](https://explosion.ai), Explosion AI 84 | - [Joel Nothman](http://joelnothman.com), University of Sydney 85 | 86 | 87 | ## Programme Committee (Hopeful list) 88 | 89 | - [Martin Andrews](http://mdda.net), Red Cat Labs 90 | - [Francis Bond](http://www.ntu.edu.sg/home/fcbond/), Nanyang Technological University 91 | - [Jason Baldridge](http://www.jasonbaldridge.com/), Google 92 | - [Steven Bethard](http://bethard.faculty.arizona.edu/), University of Arizona 93 | - [Fred Blain](https://fredblain.org), University of Sheffield 94 | - James Bradbury, Salesforce Research 95 | - [Denny Britz](http://blog.dennybritz.com/about/), Prediction Machines 96 | - [Marine Carpuat](http://www.cs.umd.edu/~marine/), University of Maryland 97 | - [Kyunghyun Cho](http://www.kyunghyuncho.me/), New York University 98 | - [Grzegorz Chrupała](http://grzegorz.chrupala.me/), Tilburg University 99 | - [Hal Daumé III](https://www.umiacs.umd.edu/~hal/), University of Maryland 100 | - [Jon Dehdari](http://jon.dehdari.org), Think Big Analytics 101 | - [Christian Federmann](http://www.cfedermann.de), Microsoft Research 102 | - [Mary Ellen Foster](http://www.dcs.gla.ac.uk/~mefoster/), University of Glasgow 103 | - [Michael Wayne Goodman](https://linguistics.washington.edu/people/michael-wayne-goodman), University of Washington 104 | - [Arwen Twinkle Griffioen](https://www.linkedin.com/in/arwengriffioen/), Zendesk Inc. 105 | - [Joel Grus](http://joelgrus.com/), Allen Institute for Artificial Intelligence 106 | - [Chris Hokamp](https://github.com/chrishokamp), Aylien Inc. 107 | - [Matthew Honnibal](https://explosion.ai), Explosion AI 108 | - [Sung Kim](https://www.cse.ust.hk/~hunkim/), Hong Kong University of Science and Technology 109 | - [Philipp Koehn](http://www.cs.jhu.edu/~phi/), Johns Hopkins University 110 | - [Taku Kudo](http://chasen.org/~taku/index.html.en), Google 111 | - [Christopher Manning](https://nlp.stanford.edu/manning/), Stanford University 112 | - [Diana Maynard](http://www.dcs.shef.ac.uk/~diana/), University of Sheffield 113 | - [Tomas Mikolov](https://research.fb.com/people/mikolov-tomas/), Facebook AI Research (FAIR) 114 | - [Ines Montani](https://ines.io), Explosion AI 115 | - [Andreas Müller](http://amueller.github.io), Columbia University 116 | - [Graham Neubig](http://www.phontron.com/), Carnegie Mellon University 117 | - [Vlad Niculae](http://vene.ro), Cornell CIS 118 | - [Joel Nothman](http://www.joelnothman.com), University of Sydney 119 | - [Matt Post](http://cs.jhu.edu/~post/), Johns Hopkins University 120 | - [David Przybilla](http://alejandro.pictures), Idio 121 | - [Amandalynne Paullada](https://linguistics.washington.edu/people/amandalynne-paullada), University of Washington 122 | - [Delip Rao](http://deliprao.com), Joostware AI Research Corp 123 | - [Radim Řehůřek](https://radimrehurek.com/), RaRe Technologies 124 | - [Elijah Rippeth](https://erip.github.io), MITRE Corporation 125 | - [Carolina Scarton](http://staffwww.dcs.shef.ac.uk/people/C.Scarton/), University of Sheffield 126 | - [Abigail See](http://cs.stanford.edu/people/abisee/), Stanford University 127 | - [Rico Sennrich](http://homepages.inf.ed.ac.uk/rsennric/), University of Edinburgh 128 | - [Dan Simonson](http://thedansimonson.com), Georgetown University 129 | - [Vered Shwartz](http://u.cs.biu.ac.il/~havivv/), Bar-Ilan University 130 | - [Ian Soboroff](https://www.nist.gov/people/ian-soboroff), NIST 131 | - [Pontus Stenetorp](http://pontus.stenetorp.se), University College London 132 | - [Rachael Tatman](http://rachaeltatman.com), Kaggle 133 | - [Tommaso Teofili](http://people.apache.org/~tommaso/), Adobe 134 | - [Emiel van Miltenburg](http://www.emielvanmiltenburg.nl), Vrije Universiteit Amsterdam 135 | - [Gaël Varoquaux](http://gael-varoquaux.info), INRIA 136 | - [Marcos Zampieri](http://uni-koeln.de/~mzampie2/index.html), University of Wolverhampton 137 | - [Maarten van Gompel](https://proycon.anaproy.nl/), Radboud University 138 | 139 | 140 | 146 | 147 | ## Other Required Infomation 148 | 149 | **Estimated no. of Attendees**: 50 150 | 151 | **Shared Task**: No 152 | 153 | **Special Requirements / Technical Needs**: No 154 | 155 | **URL for Workshop Website**: https://nlposs.github.io/ 156 | 157 | 158 | 159 | 171 | 172 | 208 | -------------------------------------------------------------------------------- /nlposs-2018/schedule-2018.orderfile: -------------------------------------------------------------------------------- 1 | + 8:45--9:00 Loading Presentations to Computer 2 | + 9:00--9:05 Opening Remarks 3 | + 9:05--9:50 Invited Talk 1 (Joel Nothman) 4 | + 9:50--10:30 Lightning Presentation for Posters Session 1 5 | + 10:30--11:00 Coffee Break 6 | + 11:00--11:45 Poster Session 1 7 | 3 # AllenNLP: A Deep Semantic Natural Language Processing Platform 8 | 7 # Stop Word Lists in Free Open-source Software Packages 9 | 11 # Texar: A Modularized, Versatile, and Extensible Toolbox for Deep Text Generation 10 | 2 # The ACL Anthology 11 | 9 # The risk of sub-optimal use of Open Source NLP Software: UKB is inadvertently state-of-the-art in knowledge-based WSD 12 | + 12:00--14:00 Lunch 13 | + 14:00--14:45 Invited Talk 2 (Christopher Manning) 14 | + 14:45--15:30 Lightning Presentation for Posters 2 15 | + 15:30--16:00 Break 16 | + 16:00--16:45 Poster Session 2 17 | 8 # Baseline: A Library for Rapid Modeling, Experimentation and Development of Deep Learning Algorithms targeting NLP 18 | 5 # Extensible toolkit for distributed and mixed precision training of sequence-to-sequence models 19 | 4 # Integrating Multiple NLP Technologies into an Open-source Platform for Multilingual Media Monitoring 20 | 6 # The Annotated Transformer 21 | + 16:45--17:30 Invited Talk 3 (Matthew Honnibal and Ines Montani) 22 | + 17:30--1735 Closing Remarks 23 | -------------------------------------------------------------------------------- /nlposs-2020/CPF-One.md: -------------------------------------------------------------------------------- 1 | (apologies for cross-posting) 2 | 3 | ---------------------------------------------------------------- 4 | 5 | *Workshop for NLP Open Source Software (NLP-OSS)* 6 | 7 | 11 or 12 Nov 2020, Co-located with EMNLP 2020 8 | 9 | https://nlposs.github.io/2020 10 | 11 | Deadline for Long and Short Paper submission: 05 August 2020 12 | 13 | (23:59, GMT-11) 14 | 15 | ---------------------------------------------------------------- 16 | 17 | You have used NLP open source tools and bore grievances but found the solution after hours of coffee and computer staring. Share that at NLP-OSS and suggest how open source could change for the better (e.g. best practices, documentation, API design etc.) 18 | 19 | You came across an awesome SOTA system on NLP task X that once ruled the F1 score. But now the code is stale and it takes an dinosaur to understand the code. Share your experience at NLP-OSS and propose how to "replicate" these forgotten systems. 20 | 21 | You read an NMT paper with SOTA BLEU scores. But now the code is stale and it takes an dinosaur to understand the code. Share your experience at NLP-OSS and propose how to "replicate" these forgotten systems. 22 | 23 | You see this shiny *BERT from a blogpost, tried it to reproduce similar results on a different task and it just doesn't work on your dataset. You did some magic to the code and now it works. Show us how you did it! Though they're small tweaks, well-motivated and empirically test are valid submissions to NLP-OSS. 24 | 25 | You have tried 101 NLP tools and there's none that really do what you want. So you wrote your own shiny new package and made it open source. Tell us why your package better than the existing tools, how did you design the code? Is it going to be a one time thing? Or would you like to see thousands of people using it? 26 | 27 | You are tired of your processing routine of using bash to call python to call perl and then output to a file that you use shell commands to munge the data. So you wrote a better data flow system to pipeline the task. Tell us why your package better than the existing tools, how did you design the code? Is it going to be a one time thing? Or would you like to see thousands of people using it? 28 | 29 | At last, you've found the avenue to air these issues in an academic platform at the NLP-OSS workshop!!! 30 | 31 | Sharing your experiences, suggestions and analysis from/of NLP-OSS 32 | 33 | 34 | P/S: 35 | 36 | 1st CALL FOR PAPERS 37 | ==== 38 | 39 | ---------------------------------------------------------------- 40 | 41 | *Workshop for NLP Open Source Software (NLP-OSS)* 42 | 43 | 11 or 12 Nov 2018, Co-located with EMNLP 2020 44 | 45 | https://nlposs.github.io/2020 46 | 47 | Deadline for Long and Short Paper submission: 05 August 2020 48 | 49 | (23:59, GMT-11) 50 | 51 | ---------------------------------------------------------------- 52 | 53 | The Second Workshop for NLP Open Source Software (NLP-OSS) will be co-located 54 | with EMNLP 2020 on 12 Nov 2020. 55 | 56 | Focusing more on the social and engineering aspect of NLP software 57 | and less on scientific novelty or state-of-art models, the Workshop for NLP-OSS 58 | is an academic forum to advance open source developments for NLP research, 59 | teaching and application. 60 | 61 | NLP-OSS also provides an academic workshop to announce new software/features, 62 | promote the collaborative culture and best practices that go beyond the conferences. 63 | 64 | We invite full papers (8 pages) or short papers (4 pages) on topics related to 65 | NLP-OSS broadly categorized into (i) software development, (ii) scientific 66 | contribution and (iii) NLP-OSS case studies. 67 | 68 | - **Software Development** 69 | - Designing and developing NLP-OSS 70 | - Licensing issues in NLP-OSS 71 | - Backwards compatibility and stale code in NLP-OSS 72 | - Growing, maintaining and motivating an NLP-OSS community 73 | - Best practices for NLP-OSS documentation and testing 74 | - Contribution to NLP-OSS without coding 75 | - Incentivizing OSS contributions in NLP 76 | - Commercialization and Intellectual Property of NLP-OSS 77 | - Defining and managing NLP-OSS project scope 78 | - Issues in API design for NLP 79 | - NLP-OSS software interoperability 80 | - Analysis of the NLP-OSS community 81 | 82 | - **Scientific Contribution** 83 | - Surveying OSS for specific NLP task(s) 84 | - Demonstration, introductions and/or tutorial of NLP-OSS 85 | - Small but useful NLP-OSS 86 | - NLP components in ML OSS 87 | - Citations and references for NLP-OSS 88 | - OSS and experiment replicability 89 | - Gaps between existing NLP-OSS 90 | - Task-generic vs task-specific software 91 | 92 | 93 | - **Case studies** 94 | - Case studies of how a specific bug is fixed or feature is added 95 | - Writing wrappers for other NLP-OSS 96 | - Writing open-source APIs for open data 97 | - Teaching NLP with OSS 98 | - NLP-OSS in the industry 99 | 100 | Submission should be formatted according to the [EMNLP 2020 templates](https://2020.emnlp.org/call-for-papers) 101 | 102 | 103 | ORGANIZERS 104 | 105 | Lucy Park, NAVER Corp. 106 | Masato Hagiwara, Octanove Labs LLC 107 | Dmitrijs Milajevs, NIST and Queen Mary University of London 108 | Nelson Liu, Stanford University 109 | Geeticka Chauhan, Massachusetts Institute of Technology 110 | Liling Tan, Rakuten Institute of Technology 111 | -------------------------------------------------------------------------------- /nlposs-2020/CPF-two.md: -------------------------------------------------------------------------------- 1 | (apologies for cross-posting) 2 | 3 | ---------------------------------------------------------------- 4 | 5 | *Workshop for NLP Open Source Software (NLP-OSS)* 6 | 7 | 11 or 12 Nov 2020, Co-located with EMNLP 2020 8 | 9 | https://nlposs.github.io/2020 10 | 11 | Deadline for Long and Short Paper submission: 05 August 2020 12 | 13 | (23:59, Anywhere on earth) 14 | 15 | ---------------------------------------------------------------- 16 | 17 | You have used NLP open source tools and bore grievances but found the solution after hours of coffee and computer staring. Share that at NLP-OSS and suggest how open source could change for the better (e.g. best practices, documentation, API design, etc.) 18 | 19 | You are tired of Zoom sessions at ACL but got inspired to write an open source for a nifty idea you got from ACL 2020. Share what you've built at NLP-OSS and what motivated you other than distracting yourself from attending Zoom QnAs; understanding about motivations of OSS forms a larger understanding of OSS sustainability in general. 20 | 21 | You came across an awesome SOTA system on NLP task X that once ruled the F1 score. But now the code is stale and it's difficult to understand the code and reproduce the results. Share your experience at NLP-OSS and propose how to "replicate" these forgotten systems. 22 | 23 | You see this shiny *BERT from a blogpost, tried it to reproduce similar results on a different task and it just doesn't work on your dataset. You did some magic to the code and now it works. Show us how you did it! Though they're small tweaks, well-motivated and empirically tested experiments are valid submissions to NLP-OSS. 24 | 25 | You have tried 101 NLP tools and there's none that really does what you want. So you wrote your shiny new package and made it open source. Tell us why your package is better than the existing tools, how did you design the code? Is it going to be a one-time thing? Or would you like to see thousands of people using it? 26 | 27 | You are tired of your processing routine of using bash to call Python to call Perl and then output to a file that you use shell commands to munge the data. So you wrote a better data flow system to pipeline the task. Tell us why your package is better than the existing tools, how did you design the code? Is it going to be a one-time thing? Or would you like to see thousands of people using it? 28 | 29 | At last, you've found the avenue to air these issues in an academic platform at the NLP-OSS workshop!!! 30 | 31 | Sharing your experiences, suggestions and analysis from/of NLP-OSS 32 | 33 | P/S: 34 | 35 | 2nd CALL FOR PAPERS 36 | ==== 37 | 38 | ---------------------------------------------------------------- 39 | 40 | *Workshop for NLP Open Source Software (NLP-OSS)* 41 | 42 | 11 or 12 Nov 2018, Co-located with EMNLP 2020 43 | 44 | https://nlposs.github.io/2020 45 | 46 | [EXTENDED] Deadline for Long and Short Paper submission: 19 August 2020 47 | 48 | (23:59, GMT-11) 49 | 50 | ---------------------------------------------------------------- 51 | 52 | The Second Workshop for NLP Open Source Software (NLP-OSS) will be co-located 53 | with EMNLP 2020 on 12 Nov 2020. 54 | 55 | Focusing more on the social and engineering aspect of NLP software 56 | and less on scientific novelty or state-of-art models, the Workshop for NLP-OSS 57 | is an academic forum to advance open source developments for NLP research, 58 | teaching and application. 59 | 60 | NLP-OSS also provides an academic workshop to announce new software/features, 61 | promote the collaborative culture and best practices that go beyond the conferences. 62 | 63 | We invite full papers (8 pages) or short papers (4 pages) on topics related to 64 | NLP-OSS broadly categorized into (i) software development, (ii) scientific 65 | contribution and (iii) NLP-OSS case studies. 66 | 67 | - **Software Development** 68 | - Designing and developing NLP-OSS 69 | - Licensing issues in NLP-OSS 70 | - Backward compatibility and stale code in NLP-OSS 71 | - Growing, maintaining and motivating an NLP-OSS community 72 | - Best practices for NLP-OSS documentation and testing 73 | - Contribution to NLP-OSS without coding 74 | - Incentivizing OSS contributions in NLP 75 | - Commercialization and Intellectual Property of NLP-OSS 76 | - Defining and managing NLP-OSS project scope 77 | - Issues in API design for NLP 78 | - NLP-OSS software interoperability 79 | - Analysis of the NLP-OSS community 80 | 81 | - **Scientific Contribution** 82 | - Surveying OSS for specific NLP task(s) 83 | - Demonstration, introductions and/or tutorial of NLP-OSS 84 | - Small but useful NLP-OSS 85 | - NLP components in ML OSS 86 | - Citations and references for NLP-OSS 87 | - OSS and experiment replicability 88 | - Gaps between existing NLP-OSS 89 | - Task-generic vs task-specific software 90 | 91 | - **Case studies** 92 | - Case studies of how a specific bug is fixed or feature is added 93 | - Writing wrappers for other NLP-OSS 94 | - Writing open-source APIs for open data 95 | - Teaching NLP with OSS 96 | - NLP-OSS in the industry 97 | 98 | Submission should be formatted according to the [EMNLP 2020 templates](https://2020.emnlp.org/call-for-papers) 99 | 100 | Due to the nature of open source software, we find it a bit tricky to "anonymize" "open source". 101 | For this reason, we don't require your publication to be anonymous. 102 | 103 | However, if you prefer your paper to be anonymized, please mask any identifiable phrase with REDACTED. 104 | 105 | We have an option setup in softconf so that you can explicitly opt-in / opt-out of anonymity. 106 | 107 | 108 | ORGANIZERS 109 | 110 | Lucy Park, NAVER Corp. 111 | Masato Hagiwara, Octanove Labs LLC 112 | Dmitrijs Milajevs, NIST and Queen Mary University of London 113 | Nelson Liu, Stanford University 114 | Geeticka Chauhan, Massachusetts Institute of Technology 115 | Liling Tan, Rakuten Institute of Technology 116 | -------------------------------------------------------------------------------- /nlposs-2020/PC-Invite-Template.md: -------------------------------------------------------------------------------- 1 | Dear ____, 2 | 3 | We would like to invite you to join the Program Committee for the Workshop for NLP Open Source Software (NLP-OSS) in 2020. 4 | 5 | The NLP-OSS workshop is unlike the normal scientific track of academic workshop, our primary objective of this workshop is to further the sharing of insights on the engineering and community aspects of creating, developing and maintaining NLP open source software (OSS) which we seldom talk about in scientific publications. The secondary goal is to promote synergies between different open source projects and encourage cross-software comparisons. 6 | 7 | More details about the first edition of the workshop workshop can be found on https://github.com/nlposs/NLP-OSS 8 | 9 | Your experience with _____ and various NLP OSS is very valuable to the workshop that could be shared with the open source NLP/MT communities. 10 | 11 | The committee comprises of experts like yourself with notable OSS contribution to ML/NLP projects. As a committee member, you can help us to review paper and promote the workshop by encouraging submissions and circulating information. Additionally, we have chosen to openly write the workshop proposal and website; we welcome contributions from you and anyone in the open community. 12 | 13 | Our proposal would be submitted to one of the four conferences and the workshop would be co-located with either: 14 | 15 | | Conference | Location | Time | 16 | |:-|:-|:-| 17 | | ACL | Seattle, Washington | 9-10 July 2020| 18 | | COLING | Barcelona, Spain | 13-14 September 2020 | 19 | | EMNLP | Barceló Bávaro Convention
Centre, Dominican Republic | 11-12 November 2020 | 20 | | AACL/IJCNLP 2020 | Suzhou, China | 4-7 December 2020 | 21 | 22 | We would be grateful if you could confirm your participation in the committee by replying to this email or post a comment on https://github.com/nlposs/NLP-OSS/issues/19. We hope that you confirm your participation soon. 23 | 24 | If you have other PC candidates that you would like to recommend for the workshop, please do contact us. 25 | 26 | With your help, I'm sure we can make NLP-OSS a success and grow the NLP OSS community together! 27 | 28 | Best Regards, 29 | NLP-OSS Organizers 30 | -------------------------------------------------------------------------------- /nlposs-2020/PC-Reinvite-Template.md: -------------------------------------------------------------------------------- 1 | Dear ____, 2 | 3 | Thank you again for being part of the Program Committee in the first edition of Workshop for NLP Open Source Software (NLP-OSS) in 2018. The workshop would not have been possible without your contribution https://nlposs.github.io/! 4 | 5 | We would like to invite you to join the Program Committee for the Workshop for NLP Open Source Software (NLP-OSS) in 2020. 6 | 7 | The NLP-OSS workshop is unlike the normal scientific track of academic workshop, our primary objective of this workshop is to further the sharing of insights on the engineering and community aspects of creating, developing and maintaining NLP open source software (OSS) which we seldom talk about in scientific publications. The secondary goal is to promote synergies between different open source projects and encourage cross-software comparisons. 8 | 9 | Your experience with various NLP OSS is very valuable to the workshop that could be shared with the open source NLP/MT communities. 10 | 11 | The committee comprises of experts like yourself with notable OSS contribution to ML/NLP projects. As a committee member, you can help us to review paper and promote the workshop by encouraging submissions and circulating information. Additionally, we have chosen to openly write the workshop proposal and website; we welcome contributions from you and anyone in the open community. 12 | 13 | Our proposal would be submitted to one of the four conferences and the workshop would be co-located with either: 14 | 15 | | Conference | Location | Time | 16 | |:-|:-|:-| 17 | | ACL | Seattle, Washington | 9-10 July 2020| 18 | | COLING | Barcelona, Spain | 13-14 September 2020 | 19 | | EMNLP | Barceló Bávaro Convention
Centre, Dominican Republic | 11-12 November 2020 | 20 | | AACL/IJCNLP 2020 | Suzhou, China | 4-7 December 2020 | 21 | 22 | We would be grateful if you could confirm your participation in the committee by replying to this email or post a comment on https://github.com/nlposs/NLP-OSS/issues/19. We hope that you confirm your participation soon. If we don't hear from you to step down from the PC role, we would automatically add you in as part of the PC in NLP-OSS 2020. 23 | 24 | If you have change your affiliations recently, please do reply to this email and we'll make the changes accordingly for the NLP-OSS 2020 website. 25 | 26 | If you have other PC candidates that you would like to recommend for the workshop, please do contact us. 27 | 28 | With your help, I'm sure we can make NLP-OSS a success and grow the NLP OSS community together! 29 | 30 | Best Regards, 31 | NLP-OSS Organizers 32 | -------------------------------------------------------------------------------- /nlposs-2020/PC-Updates-1.md: -------------------------------------------------------------------------------- 1 | Dear Program Committee Members, 2 | 3 | Thank you all again for accepting our invitation to the NLP-OSS 2020 Program Committee! 4 | 5 | The 2nd edition of NLP-OSS workshop would be held in collocated with EMNLP 2020 https://nlposs.github.io/2020/ on 11 or 12 November 2020. 6 | 7 | Some updates on the NLP-OSS organization as we proceed to send out the Call for Paper soon: 8 | 9 | - Firstly, we welcome Nelson Liu (Stanford University) and Geeticka Chauhan (Massachusetts Institute of Technology) as our new co-organizers for the workshop. They share the same passion for opens source, experimental replicability and sustainability in the NLP. 10 | 11 | - We welcome new PC members from our fellow PC members' and co-organizers contact as well as new invitations extended to the Widening NLP community through the [BIG directory](http://www.winlp.org) as well as our shortlisted members whom have expressed interest through our proposal survey. 12 | 13 | - We are excited to have all 3 invited speakers, Chip Huyen (previously from NVIDIA, working on varous NLP projects ranging from the SOTAWHAT, LazyNLP, OpenSeq2Seq, Nemo), Spencer Kelly (creator of the [Compromise Javascript NLP library](https://github.com/spencermountain/compromise)) and Thomas Wolf (from Huggingface working on the various open source libraries including the popular PyTorch Transformers). 14 | 15 | 18 | 19 | Administrative matters: 20 | 21 | - The **paper submission deadline is 01 July** and the **reviews will start 05 July** would be **due on 05 August**. 22 | 23 | - If the affiliations on the workshop site is wrong or if you find any typos on your name of affiliations or any part of the page, please do tell us https://nlposs.github.io/2020/index.html =) 24 | 25 | If you know of any potential work that is relevant to the workshop, we hope that you can help to encourage submissions and/or participation to the workshop to your local community. We'll be sending out the first call for paper very soon! 26 | 27 | Keep in touch! 28 | 29 | Best Regards,
30 | NLP-OSS Organizers 31 | -------------------------------------------------------------------------------- /nlposs-2020/PC-Updates-2.md: -------------------------------------------------------------------------------- 1 | Dear Program Committee Members, 2 | 3 | Thank you for again for accepting our invitation to the NLP-OSS 2020 Program Committee! 4 | 5 | The NLP-OSS workshop has been accepted to [EMNLP 2020](https://2020.emnlp.org/). Paper submissions will as soon as the EMNLP conference organizers open up the softconf system. 6 | 7 | As a committee member, you will help us to review paper and promote the workshop by encouraging submissions and circulating information. Additionally, we have chosen to be as open as possible in how we organize the workshop; we welcome contributions from you and anyone in the open community to improve the workshop. 8 | 9 | The important dates for NLP-OSS workshop are as follows. 10 | 11 | | Item | Dates | 12 | |:-|:-| 13 | | Paper submission | 01 July 2020 | 14 | | Paper Reviews Starts | 05 July 2020 | 15 | | Paper Reviews Due | 05 August 2020 | 16 | | Notification of Acceptance | 10 August 2020 | 17 | | Camera-Ready Version | 25 August 2020 | 18 | | Actual Workshop | 11 or 12 November 2020 | 19 | 20 | While the dates for review 05 July to 05 August is some time away, we hope that you encourage any submissions or participation to the workshop to your local community. 21 | 22 | We will keep the [NLP-OSS site](https://nlposs.github.io/2020/) updated and notify you through email for any news. 23 | 24 | Thank you for being part of the NLP-OSS Program Committee. We look forward to another frutiful workshop and hope to see you at the workshop too! 25 | 26 | Regards, 27 | NLP-OSS Organizers 28 | 29 | 30 | -------------------------------------------------------------------------------- /nlposs-2020/PC-reInvite-noreply-follow-up.md: -------------------------------------------------------------------------------- 1 | Dear ____, 2 | 3 | Thank you again for being part of the Program Committee in the first edition of Workshop for NLP Open Source Software (NLP-OSS) in 2018! The workshop would not have been possible without your contribution https://nlposs.github.io/! 4 | 5 | We have sent you an invite to the 2020 workshop but we haven't heard from you since our invitation. 6 | 7 | **Could we check with you whether you would like to continue to join us as a program committee (PC) member for the workshop in 2020?** Your experience with various NLP OSS is very valuable to the workshop that could be shared with the open source NLP communities. 8 | 9 | **We would be grateful if you could confirm your participation in the committee by replying to this email or post a comment on https://github.com/nlposs/NLP-OSS/issues/19.** 10 | 11 | We have kept the workshop proposal up to date on https://github.com/nlposs/NLP-OSS 12 | 13 | As a committee member, you would help us to review paper and promote the workshop by encouraging submissions and circulating information. 14 | 15 | Please do give us any feedback about the workshop if you have any. Or any concerns about commitment as a PC member. 16 | 17 | Hope to hear from you soon! 18 | 19 | Best Regards, 20 | NLP-OSS Organizers 21 | -------------------------------------------------------------------------------- /nlposs-2020/PC-reInvite-noreply-followup-lastcheck-finalthankyou.md: -------------------------------------------------------------------------------- 1 | Dear ____, 2 | 3 | Thank you again for being part of the Program Committee in the first edition of Workshop for NLP Open Source Software (NLP-OSS) in 2018! The workshop would not have been possible without your contribution https://nlposs.github.io/! 4 | 5 | We have sent you an invite to the 2020 workshop previously but we haven't heard from you since our invitation. The 2nd edition of NLP-OSS workshop would be held in collocated with EMNLP 2020 https://nlposs.github.io/2020/ on 11 or 12 November 2020. 6 | 7 | We understand that you might have other committments and would not participate in the workshop as a PC member. 8 | 9 | To avoid missing review deadlines and empty reviews, **we would remove you from the PC list since we do not hear from you by 03 March 2020.** 10 | 11 | If you have missed our previous emails and would still like to participate as a PC member, please do reply to this email to indicate your participation. Also, if you would like to keep in touch or be removed from our mailing list please do reply to this email to indicate your contact preferences. 12 | 13 | If you know of any potential work that is relevant to the workshop, we hope that you can help to encourage submissions and/or participation to the workshop to your local community. We'll be sending out the first call for paper very soon! 14 | 15 | Please do give us any feedback about the workshop if you have any. 16 | 17 | Thank you again for being part of the Program Committee in NLP-OSS 2018! 18 | 19 | Best Regards,
20 | NLP-OSS Organizers 21 | -------------------------------------------------------------------------------- /nlposs-2020/PC-reInvite-noreply-followup-lastcheck.md: -------------------------------------------------------------------------------- 1 | Dear ____, 2 | 3 | Thank you again for being part of the Program Committee in the first edition of Workshop for NLP Open Source Software (NLP-OSS) in 2018! The workshop would not have been possible without your contribution https://nlposs.github.io/! 4 | 5 | We have sent you an invite to the 2020 workshop previously but we haven't heard from you since our invitation. The 2nd edition of NLP-OSS workshop would be held in collocated with EMNLP 2020 https://nlposs.github.io/2020/ on 11 or 12 November 2020. 6 | 7 | **Could we check with you whether you would like to continue to join us as a program committee (PC) member for the workshop in 2020?** Your experience with various NLP OSS is very valuable to the workshop that could be shared with the open source NLP communities. 8 | 9 | We would be grateful if you could confirm your participation in the committee by replying to this email or post a comment on https://github.com/nlposs/NLP-OSS/issues/19. **If you have change your affiliations recently, please do reply to this email** and we'll make the changes accordingly for the NLP-OSS 2020 website. 10 | 11 | As a committee member, you would help us to review paper and promote the workshop by encouraging submissions and circulating information. 12 | 13 | To avoid missing review deadlines and empty reviews, **we would remove you from the PC list if we do not hear from you by 03 March 2020.** In the event that you are unable to commit to the workshop PC, we understand that there are other commitments that you may be having but if you would like to keep in touch or be removed from our mailing list please do reply to this email to indicate your contact preferences. 14 | 15 | Please do give us any feedback about the workshop if you have any. Or any concerns about commitment as a PC member. 16 | 17 | Hope to hear from you soon! 18 | 19 | Best Regards,
20 | NLP-OSS Organizers 21 | -------------------------------------------------------------------------------- /nlposs-2020/Proposal-2020.md: -------------------------------------------------------------------------------- 1 | 2nd Workshop for Natural Language Processing Open Source Software (NLP-OSS) 2 | ==== 3 | 4 | 5 | With great scientific breakthrough comes solid engineering and open communities. The Natural Language Processing (NLP) community has benefited greatly from the open culture in sharing knowledge, data, and software. The primary objective of this workshop is to further the sharing of insights on the engineering and community aspects of creating, developing, and maintaining NLP open source software (OSS), which we seldom talk about in scientific publications. Our secondary goal is to promote synergies between different open source projects and encourage cross-software collaborations and comparisons. 6 | 7 | We refer to Natural Language Processing OSS as an umbrella term that not only covers traditional syntactic, semantic, phonetic, and pragmatic applications; we extend the definition to include task-specific applications (e.g., machine translation, information retrieval, question-answering systems), low-level string processing that contains valid linguistic information (e.g. Unicode creation for new languages, language-based character set definitions) and machine learning/artificial intelligence frameworks with functionalities focusing on text applications. 8 | 9 | There are many workshops focusing on the creation and curation of open language resources and annotations (e.g. BUCC, GWN, LAW, LOD, WAC). Moreover, we have the flagship LREC conference dedicated to linguistic resources. However, the engineering aspects of NLP-OSS are overlooked and under-discussed within the community. There are open source conferences and venues (such as FOSDEM, OSCON, Open Source Summit) where discussions range from operating system kernels to air traffic control hardware but the representation of NLP related presentations is limited. In the Machine Learning (ML) field, the Journal of Machine Learning Research - Machine Learning Open Source Software (JMLR-MLOSS) is a forum for discussions and dissemination of ML OSS topics. We envision that the Workshop for NLP-OSS becomes a similar avenue for NLP-OSS discussions. 10 | 11 | A decade ago, there was also the SETQA-NLP (Software Engineering, Testing, and Quality Assurance for Natural Language Processing) workshop that raised awareness of the need for good software engineering practices in NLP. In the earlier days of NLP, linguistic software was often monolithic and the learning curve to install, use, and extend the tools was steep and frustrating. More often than not, NLP-OSS developers/users interact in siloed communities within the ecologies of their respective projects. In addition to engineering aspects of NLP software, the open source movement has brought a community aspect that we often overlook in building impactful NLP technologies. 12 | 13 | One example of NLP-OSS synergy is NLTK's support for Stanford NLP tools which provide a Pythonic interface to the Stanford tools written in Java. More recently, the REST-ful API from Stanford CoreNLP tools has alleviated a host of issues that are related to cross-OSS interfaces in NLTK (c.f. https://github.com/nltk/nltk/pull/1249). The developers have also interacted across their respective code repositories to raise issues and give code reviews. Beyond the diamond-sharpening effect of cross-OSS collaborations, the result of the successful interface between the tools opens the door to easily benchmark annotations created by NLTK and Stanford CoreNLP. 14 | 15 | Another example of precious OSS knowledge comes from SpaCy developer [Montani (2017)](https://ines.io/blog/spacy-commercial-open-source-nlp), who shared her thoughts and challenges of maintaining commercial NLP-OSS, such as handling open issues on the issue tracker, model release and packaging strategy and monetizing NLP OSS for sustainability. More recently, there is also the [PyTorch Transformer](https://github.com/huggingface/pytorch-transformers) by Hugging Face, which has gathered much interest from the community by open sourcing implementations and pretrained weights of BERT-like models, in a clean and well-organized structure. The interoperability of various pretrained models trained with different tools in one library enables quick benchmarking across the models, as well as developing best practices for reading/saving serialized interoperable models. 16 | 17 | The first NLP-OSS workshop, which was co-located with ACL 2018, was the first workshop in recent years that focused more on building quality software for NLP, open sourcing, developing useful engineering practices, and less on scientific novelty or state-of-art development. We hope that the 2nd NLP-OSS workshop could also be hosted in an *ACL conference, to be an intellectual forum to collate this type of knowledge, announce new software/features, promote the open source culture and best practices that go beyond the conferences. 18 | 19 | 20 | ## Call for Papers 21 | 22 | We invite full papers (8 pages) or short papers (4 pages) on topics related to NLP-OSS broadly categorized into (i) software development, (ii) scientific contribution and (iii) NLP-OSS case studies. 23 | 24 | - **Software Development** 25 | - Designing and developing NLP-OSS 26 | - Licensing issues in NLP-OSS 27 | - Backwards compatibility and stale code in NLP-OSS 28 | - Growing, maintaining and motivating an NLP-OSS community 29 | - Best practices for NLP-OSS documentation and testing 30 | - Contribution to NLP-OSS without coding 31 | - Incentivizing OSS contributions in NLP 32 | - Commercialization and Intellectual Property of NLP-OSS 33 | - Defining and managing NLP-OSS project scope 34 | - Issues in API design for NLP 35 | - NLP-OSS software interoperability 36 | - Analysis of the NLP-OSS community 37 | 38 | - **Scientific Contribution** 39 | - Surveying OSS for specific NLP task(s) 40 | - Demonstration, introductions and/or tutorial of NLP-OSS 41 | - Small but useful NLP-OSS 42 | - NLP components in ML OSS 43 | - Citations and references for NLP-OSS 44 | - OSS and experiment replicability 45 | - Gaps between existing NLP-OSS 46 | - Task-generic vs task-specific software 47 | 48 | 49 | - **Case studies** 50 | - Case studies of how a specific bug is fixed or feature is added 51 | - Writing wrappers for other NLP-OSS 52 | - Writing open-source APIs for open data 53 | - Teaching NLP with OSS 54 | - NLP-OSS in the industry 55 | 56 | ## Invited Speakers 57 | 58 | - [Chip Huyen](https://huyenchip.com), NVIDIA 59 | - [Spencer Kelly](https://spencermounta.in/), Freelance Developer 60 | - [Thomas Wolf](https://thomwolf.io), Huggingface 61 | 62 | ## Demographic Diversity 63 | 64 | **Organizers:** We have 6 organizers with representation from industrial NLP/ML labs, consultancy labs and academic research. 65 | 66 | **PC members:** We strive to a have a balance of academic and industrial PC from diverse gender and geolocation demographics. On top of our existing PC members from NLP-OSS 2018, we've invited new PCs members by (i) recommendation of previous PC members and (ii) extending our invitation to a subset of WiNLP members on the BIG directory (we tried to contact as many as possible and in total we've sent to ~30 invitations to WiNLP members); if they have not replied and accepted the invitation, we have not listed in the PC list below. 67 | 68 | **Invited speakers**: Our invited speakers come from various backgrounds and have been involved with NLP open source development and may not have been that active within the ACL community. We strive to continue the tradition of inviting a diverse range of speakers to talk about NLP OSS to talk about a balance between OSS development, scientific contribution and community building around OSS. All three speakers have confirmed their participation to give an invited talk (depending on their availability after the exact conference assignment). 69 | 70 | To ensure diversity and inclusivity, we have drafted a [Code of Conduct](https://github.com/nlposs/NLP-OSS/blob/master/Code-of-Conduct.md) for the NLP-OSS workshop and community and we would put up the Code of Conduct on the NLP-OSS 2020 website once we have sought advice from the ACL Professional Conduct Committee. 71 | 72 | ## Misc 73 | 74 | **Estimated no. of Attendees**: 50 75 | 76 | **Shared Task**: No 77 | 78 | **Special Requirements / Technical Needs**: No 79 | 80 | **Preferred Venue**: 81 | 82 | 1. ACL 83 | 2. EMNLP 84 | 3. COLING 85 | 4. AACL/IJCNLP 86 | 87 | 88 | **Previous Workshop**: 89 | 90 | - [First Workshop for NLP-OSS at ACL 2018](https://nlposs.github.io) 91 | - 10 submissions, 9 accepted publications 92 | - 30+ attendees 93 | - 3 invited speakers 94 | 95 | **Expected no. of submissions**: 15-20 submissions 96 | 97 | ## Organizers 98 | 99 | - [Lucy Park](https://github.com/e9t), NAVER Corp. 100 | 101 | Lucy is a machine learning engineer at NAVER. She has participated in some open source projects, particularly [KoNLPy](http://konlpy.org) which is a tool for Korean NLP, and is also interested in open data. She received her Ph.D. in Data Mining from Seoul National University in 2016, where she has pursued various studies on text mining in the fields of manufacturing, political science, and multimedia. After her studies, she joined NAVER, a South Korea based search-engine company, and is currently working on machine translation for Papago. Her research interests include machine translation, multilingual text mining, and evaluation of machine learning algorithms. 102 | 103 | - [Masato Hagiwara](http://masatohagiwara.net/), Octanove Labs LLC 104 | 105 | Masato Hagiwara is an independent NLP/ML engineer and researcher at Octanove Labs. He received his Ph.D. degree in Information Science from Nagoya University in 2009. During his Ph.D., he worked at Google and Microsoft Research as an intern, and thereafter at Baidu Japan and Rakuten Institute of Technology, focusing on search engine-related language processing research. Most recently he was working as a Senior Machine Learning Engineer at Duolingo, focusing on educational applications of NLP. He received several paper awards from Japanese domestic conferences for his work on knowledge acquisition and transliteration. He also co-organized several workshops and shared tasks, including NLP-OSS 2018. 106 | 107 | - [Dmitrijs Milajevs](http://zest.id.lv/), KPMG LLP. 108 | 109 | Dmitrijs Milajevs is a data scientist at KMPG. Previously, he evaluated information retrieval systems at National Institute of Standards and Technology (NIST). He has defended a Ph.D. thesis on evaluation of compositional models in distributional semantics. 110 | 111 | - [Nelson Liu](https://cs.stanford.edu/~nfliu/), Stanford University. 112 | 113 | Nelson Liu is a computer science Ph.D. student at Stanford, where he works in the Stanford NLP Group. He has contributed to the AllenNLP, torchtext, and scikit-learn projects at various points in time. 114 | 115 | - [Geeticka Chauhan](https://www.csail.mit.edu/person/geeticka-chauhan), Massachusetts Institute of Technology 116 | 117 | Geeticka Chauhan is a Ph.D. student at MIT, working on NLP for healthcare advised by Prof. Peter Szolovits. Her master thesis focused on revealing the reproducibility and generalizability problems in Relation Extraction, and experimentally showed the importance of streamlining evaluation methods in NLP challenges 118 | 119 | 120 | - [Liling Tan](https://github.com/alvations), Rakuten Institute of Technology 121 | 122 | Liling is a research scientist at Rakuten Institute of Technology working on Machine Translation and developing applications using language technologies. He has been actively involved in corpora creation/maintenance, Asian NLP and machine translation. He co-organized the [Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2014-16)](http://ttg.uni-saarland.de/vardial2016/). 123 | 124 | 125 | ## Programme Committee 126 | 127 | - [Tareq Abdo Abdullah Al-Moslmi](https://www.uib.no/en/persons/Tareq.Abdo.Abdullah.Al-Moslmi), University of Bergen 128 | - [Martin Andrews](http://mdda.net), Red Cat Labs 129 | - [Eleftherios Avramidis](), Deutsches Forschungszentrum für Künstliche Intelligenz 130 | - Amittai Axelrod, DiDi Chuxing (Los Angeles) 131 | - Tim Baldwin, University of Melbourne 132 | - [Marcel Bollmann](https://marcel.bollmann.me/), University of Copenhagen 133 | - [Francis Bond](http://www.ntu.edu.sg/home/fcbond/), Nanyang Technological University 134 | - [Steven Bethard](http://bethard.faculty.arizona.edu/), University of Arizona 135 | - [Fred Blain](https://fredblain.org), University of Sheffield 136 | - James Bradbury, Salesforce Research 137 | - [Daniel Braun](http://wwwmatthes.in.tum.de), Technical University of Munich 138 | - [Denny Britz](http://blog.dennybritz.com/about/), Prediction Machines 139 | - [Muthu Kumar Chandrasekaran](http://wing.comp.nus.edu.sg/~cmkumar/), National University of Singapore 140 | - Sharat Chikkerur, Microsoft 141 | - [Kyunghyun Cho](http://www.kyunghyuncho.me/), New York University 142 | - Shamil Chollampatt, Rakuten Institute of Technology 143 | - [Jon Dehdari](http://jon.dehdari.org), Think Big Analytics 144 | - Steve DeNeefe, SDL 145 | - [Leon Derczynski](http://www.derczynski.com/itu/), IT University of Copenhagen 146 | - [Anca Dumitrache](http://ancad.ro/), FD Mediagroep 147 | - Gérard Dupont, Airbus 148 | - [Ignatius Ezeani](https://www.lancaster.ac.uk/scc/about-us/people/ignatius-ezeani), Lancaster University 149 | - [Marzieh Fadaee](https://staff.fnwi.uva.nl/m.fadaee/), University of Amsterdam 150 | - [Christian Federmann](http://www.cfedermann.de), Microsoft Research 151 | - [Mary Ellen Foster](http://www.dcs.gla.ac.uk/~mefoster/), University of Glasgow 152 | - [Michael Wayne Goodman](https://goodmami.org/), Nanyang Technological University 153 | - [Arwen Twinkle Griffioen](https://www.linkedin.com/in/arwengriffioen/), Zendesk Inc. 154 | - [Joel Grus](http://joelgrus.com/), Allen Institute for Artificial Intelligence 155 | - [Chris Hokamp](https://github.com/chrishokamp), AYLIEN 156 | - [Matthew Honnibal](https://explosion.ai), Explosion AI 157 | - [David M. Howcroft](https://davehowcroft.com/), 158 | - [Fabio Kepler](), Unbabel 159 | - [Sung Kim](https://www.cse.ust.hk/~hunkim/), Hong Kong University of Science and Technology 160 | - [Thomas Kober](https://www.inf.ed.ac.uk/people/staff/Thomas_Kober.html), University of Edinburgh 161 | - [Philipp Koehn](http://www.cs.jhu.edu/~phi/), Johns Hopkins University 162 | - [Taku Kudo](http://chasen.org/~taku/index.html.en), Google 163 | - Varun Kumar, Amazon Alexa 164 | - [Paul Pu Liang](http://www.cs.cmu.edu/~pliang/), Carnegie Mellon University 165 | - [Sandya Mannarswamy](https://scholar.google.co.in/citations?user=i27nd3oAAAAJ&hl=en), Independent Researcher 166 | - [Christopher Manning](https://nlp.stanford.edu/manning/), Stanford University 167 | - Laura Martinus, Explore Data Science Academy 168 | - Ehsan Khoddam Mohammadi, RELX 169 | - [Ines Montani](https://ines.io), Explosion AI 170 | - [Vlad Niculae](http://vene.ro), Instituto de Telecomunicações 171 | - [Joel Nothman](http://www.joelnothman.com), University of Sydney 172 | - Yves Peirsman, NLP Town 173 | - [Tommi A Pirinen](https://www.computing.dcu.ie/~tpirinen/), University of Hamburg 174 | - [Aline Paes](https://uff.academia.edu/AlinePaes), Universidade Federal Fluminense 175 | - [Matt Post](http://cs.jhu.edu/~post/), Johns Hopkins University 176 | - [David Przybilla](http://alejandro.pictures), Idio 177 | - [Amandalynne Paullada](https://linguistics.washington.edu/people/amandalynne-paullada), University of Washington 178 | - [Delip Rao](http://deliprao.com), AI Foundation 179 | - [Radim Řehůřek](https://radimrehurek.com/), RaRe Technologies 180 | - [Elijah Rippeth](https://erip.github.io), MITRE Corporation 181 | - [Mohd Sanad Zaki Rizvi](https://www.linkedin.com/in/mohd-sanad-zaki-rizvi-0238b5a6/), Analytics Vidhya 182 | - [Ivan Vladimir Meza Ruiz](http://turing.iimas.unam.mx/~ivanvladimir/), National University of Mexico 183 | - [Carolina Scarton](http://staffwww.dcs.shef.ac.uk/people/C.Scarton/), University of Sheffield 184 | - [Micah Shlain](https://www.linkedin.com/in/micah-shlain-52254323), Allen Institute for Artificial Intelligence 185 | - [Rico Sennrich](http://homepages.inf.ed.ac.uk/rsennric/), University of Edinburgh 186 | - [Dan Simonson](http://thedansimonson.com), Georgetown University 187 | - [Steve Sloto](https://www.linkedin.com/in/ssloto/), Amazon Web Services 188 | - [Karin Sim Smith](), University of Sheffield 189 | - [Ian Soboroff](https://www.nist.gov/people/ian-soboroff), NIST 190 | - Shilpa Suresh, Independent Researcher 191 | - [Tilahun Abedissa Taffa](), Kotebe Metropolitan University 192 | - [Tommaso Teofili](http://people.apache.org/~tommaso/), Adobe 193 | - [Emiel van Miltenburg](http://www.emielvanmiltenburg.nl), Vrije Universiteit Amsterdam 194 | - [Svitlana Vakulenko](svakulenk0.github.io), Vienna University of Economics and Business 195 | - [Gaël Varoquaux](http://gael-varoquaux.info), INRIA 196 | - [Moshe Wasserblat](), Intel Corporation 197 | - [Marcos Zampieri](http://uni-koeln.de/~mzampie2/index.html), University of Wolverhampton 198 | - [Maarten van Gompel](https://proycon.anaproy.nl/), Radboud University 199 | 200 | 201 | 258 | 259 | 265 | 266 | -------------------------------------------------------------------------------- /nlposs-2020/Speaker-Invite-1.md: -------------------------------------------------------------------------------- 1 | Dear Spencer, 2 | 3 | We are excited to invite you as our guest speaker for the 2nd Workshop for NLP Open Source Software (NLP-OSS) in year 2020. 4 | 5 | We get to know about [Compose](https://github.com/spencermountain/compromise) when looking for JavaScript solutions for NLP and we have found the library to be interesting and under-represented by the Association of Computational Linguistic (ACL) community. Your experience with Compose is very valuable to the workshop that could be shared with the ACL community at the NLP-OSS workshop. 6 | 7 | We would like to invite you to give a talk on any aspect(s) of Open Source Software and the Compose Library or any other NLP OSS that you would like to share about. 8 | 9 | The NLP-OSS workshop is unlike the normal scientific track of academic workshop, our primary objective of this workshop is to further the sharing of insights on the engineering and community aspects of creating, developing and maintaining NLP open source software (OSS) which we seldom talk about in scientific publications. The secondary goal is to promote synergies between different open source projects and encourage cross-software comparisons. Our organizers and committee comprises of experts like yourself with notable OSS contribution to ML/NLP projects. 10 | 11 | More details about the first edition of the workshop workshop can be found on https://nlposs.github.io/ 12 | 13 | Our proposal would be submitted to one of the four conferences and the workshop would be co-located with either: 14 | 15 | | Conference | Location | Time | 16 | |:-|:-|:-| 17 | | ACL | Seattle, Washington | 9-10 July 2020| 18 | | COLING | Barcelona, Spain | 13-14 September 2020 | 19 | | EMNLP | Barceló Bávaro Convention
Centre, Dominican Republic | 11-12 November 2020 | 20 | | AACL/IJCNLP 2020 | Suzhou, China | 4-7 December 2020 | 21 | 22 | We understand that confirming your participation across that many dates might be too overwhelming. But by end October, we will get the proposal confirmation from ACL and we can confirm the exact dates for our workshop (it just be one of four dates listed above). But we would need your agreement to be one of invited speakers before 22 September when submitting our proposal to the ACL committee. 23 | 24 | We look forward to your response and hopefully seeing you speak at NLP-OSS 2020! 25 | 26 | Thank you in advance for your response. 27 | 28 | Best Regards, 29 | NLP-OSS Organizers 30 | 31 | 32 | 33 | -------------------------------------------------------------------------------- /nlposs-2020/Speaker-Invite-2.md: -------------------------------------------------------------------------------- 1 | Dear Thomas, 2 | 3 | We are excited to invite you as our guest speaker for the 2nd Workshop for NLP Open Source Software (NLP-OSS) in year 2020. 4 | 5 | The PyTorch Transformer tool has been one of the most influential NLP OSS in the previous years. The initial idea of putting all BERT-like models in once place is so good, we wished we had also thought about that earlier! Your experience in putting together so many different models into a single library and how to design the library's interface in a generic way to accomodate the different nuances of the models would be very valuable to the workshop and provides the first-hand experience at creating and maintaining NLP Open Source Software. 6 | 7 | We would like to invite you to give a talk on any aspect(s) of Open Source Software and PyTorch Transformer or any other NLP OSS that you would like to share about. 8 | 9 | The NLP-OSS workshop is unlike the normal scientific track of academic workshop, our primary objective of this workshop is to further the sharing of insights on the engineering and community aspects of creating, developing and maintaining NLP open source software (OSS) which we seldom talk about in scientific publications. The secondary goal is to promote synergies between different open source projects and encourage cross-software comparisons. Our organizers and committee comprise of experts like yourself with notable OSS contribution to ML/NLP projects. 10 | 11 | More details about the first edition of the workshop can be found on https://nlposs.github.io/ 12 | 13 | Our proposal would be submitted to one of the four conferences and the workshop would be co-located with one of: 14 | 15 | | Conference | Location | Time | 16 | |:-|:-|:-| 17 | | ACL | Seattle, Washington | 9-10 July 2020| 18 | | COLING | Barcelona, Spain | 13-14 September 2020 | 19 | | EMNLP | Barceló Bávaro Convention
Centre, Dominican Republic | 11-12 November 2020 | 20 | | AACL/IJCNLP 2020 | Suzhou, China | 4-7 December 2020 | 21 | 22 | We understand that confirming your participation across that many dates might be too overwhelming. But by end October, we will get the proposal confirmation from ACL and we can confirm the exact dates for our workshop (it just be one of four dates listed above). But we would need your agreement to be one of invited speakers before 22 September when submitting our proposal to the ACL committee. 23 | 24 | We look forward to your response and hopefully seeing you speak at NLP-OSS 2020! 25 | 26 | Thank you in advance for your response. 27 | 28 | Best Regards, 29 | NLP-OSS Organizers 30 | -------------------------------------------------------------------------------- /nlposs-2020/Speaker-Invite-3.md: -------------------------------------------------------------------------------- 1 | Dear Chip, 2 | 3 | We are excited to invite you as our guest speaker for the 2nd Workshop for NLP Open Source Software (NLP-OSS) in year 2020. 4 | 5 | We get to know about you from your interesting blog posts and the various open source projects that you have built. Your passion in open sourcing various small projects has been inspiring and we think the ACL community would benefit if you would be willing to share at the workshop what is open source to you and why you have open source various fun NLP related projects. 6 | 7 | We would like to invite you to give a talk on any aspect(s) of Open Source Software and SOTAWHAT, lazyNLP, OpenSeq2Seq or any other NLP OSS that you would like to share about. 8 | 9 | The NLP-OSS workshop is unlike the normal scientific track of academic workshop, our primary objective of this workshop is to further the sharing of insights on the engineering and community aspects of creating, developing and maintaining NLP open source software (OSS) which we seldom talk about in scientific publications. The secondary goal is to promote synergies between different open source projects and encourage cross-software comparisons. Our organizers and committee comprises of experts like yourself with notable OSS contribution to ML/NLP projects. 10 | 11 | More details about the first edition of the workshop workshop can be found on https://nlposs.github.io/ 12 | 13 | Our proposal would be submitted to one of the four conferences and the workshop would be co-located with either: 14 | 15 | | Conference | Location | Time | 16 | |:-|:-|:-| 17 | | ACL | Seattle, Washington | 9-10 July 2020| 18 | | COLING | Barcelona, Spain | 13-14 September 2020 | 19 | | EMNLP | Barceló Bávaro Convention
Centre, Dominican Republic | 11-12 November 2020 | 20 | | AACL/IJCNLP 2020 | Suzhou, China | 4-7 December 2020 | 21 | 22 | We understand that confirming your participation across that many dates might be too overwhelming. But by end October, we will get the proposal confirmation from ACL and we can confirm the exact dates for our workshop (it just be one of four dates listed above). But we would need your agreement to be one of invited speakers before 22 September when submitting our proposal to the ACL committee. 23 | 24 | We look forward to your response and hopefully seeing you speak at NLP-OSS 2020! 25 | 26 | Thank you in advance for your response. 27 | 28 | Best Regards, 29 | NLP-OSS Organizers 30 | -------------------------------------------------------------------------------- /nlposs-2020/Speaker-Invite-4-5.md: -------------------------------------------------------------------------------- 1 | Dear Matt and Ines, 2 | 3 | We are excited to invite you as our guest speaker for the 2nd Workshop for NLP Open Source Software (NLP-OSS) in year 2020. 4 | 5 | We are sorry to have missed you in the previous edition of the NLP-OSS workshop, we have been fans of your work and definitely the greater ACL community have benefited from the SpaCy one way or another. We would like to invite you to give a talk on any aspect(s) of Open Source Software and SpaCy or any other NLP OSS that you would like to share about. 6 | 7 | The NLP-OSS workshop is unlike the normal scientific track of academic workshop, our primary objective of this workshop is to further the sharing of insights on the engineering and community aspects of creating, developing and maintaining NLP open source software (OSS) which we seldom talk about in scientific publications. The secondary goal is to promote synergies between different open source projects and encourage cross-software comparisons. Our organizers and committee comprise of experts like yourself with notable OSS contribution to ML/NLP projects. 8 | 9 | More details about the first edition of the workshop can be found on https://nlposs.github.io/ 10 | 11 | Our proposal would be submitted to one of the four conferences and the workshop would be co-located with either: 12 | 13 | | Conference | Location | Time | 14 | |:-|:-|:-| 15 | | ACL | Seattle, Washington | 9-10 July 2020| 16 | | COLING | Barcelona, Spain | 13-14 September 2020 | 17 | | EMNLP | Barceló Bávaro Convention
Centre, Dominican Republic | 11-12 November 2020 | 18 | | AACL/IJCNLP 2020 | Suzhou, China | 4-7 December 2020 | 19 | 20 | We understand that confirming your participation across that many dates might be too overwhelming. But by end October, we will get the proposal confirmation from ACL and we can confirm the exact dates for our workshop (it just be one of four dates listed above). But we would need your agreement to be one of invited speakers before 22 September when submitting our proposal to the ACL committee. 21 | 22 | We look forward to your response and hopefully seeing you speak at NLP-OSS 2020! 23 | 24 | Thank you in advance for your response. 25 | 26 | Best Regards, 27 | NLP-OSS Organizers 28 | -------------------------------------------------------------------------------- /nlposs-2020/Speaker-Invite-6-7-8.md: -------------------------------------------------------------------------------- 1 | Dear Matt, Mark and Joel, 2 | 3 | We are excited to invite you as our guest speaker for the 2nd Workshop for NLP Open Source Software (NLP-OSS) in year 2020. 4 | 5 | We get to know about you all from your work on AllenNLP and the awesome tutorial from last year EMNLP. Your experience with AllenNLP and reaching out to the community would be very valuable to the NLP-OSS workshop attendees. We would like to invite you to give a talk on any aspect(s) of Open Source Software and AllenNLP or any other NLP OSS that you would like to share about. 6 | 7 | The NLP-OSS workshop is unlike the normal scientific track of academic workshop, our primary objective of this workshop is to further the sharing of insights on the engineering and community aspects of creating, developing and maintaining NLP open source software (OSS) which we seldom talk about in scientific publications. The secondary goal is to promote synergies between different open source projects and encourage cross-software comparisons. Our organizers and committee comprises of experts like yourself with notable OSS contribution to ML/NLP projects. 8 | 9 | More details about the first edition of the workshop workshop can be found on https://nlposs.github.io/ 10 | 11 | Our proposal would be submitted to one of the four conferences and the workshop would be co-located with either: 12 | 13 | | Conference | Location | Time | 14 | |:-|:-|:-| 15 | | ACL | Seattle, Washington | 9-10 July 2020| 16 | | COLING | Barcelona, Spain | 13-14 September 2020 | 17 | | EMNLP | Barceló Bávaro Convention
Centre, Dominican Republic | 11-12 November 2020 | 18 | | AACL/IJCNLP 2020 | Suzhou, China | 4-7 December 2020 | 19 | 20 | We understand that confirming your participation across that many dates might be too overwhelming. But by end October, we will get the proposal confirmation from ACL and we can confirm the exact dates for our workshop (it just be one of four dates listed above). But we would need your agreement to be one of invited speakers before 22 September when submitting our proposal to the ACL committee. 21 | 22 | We look forward to your response and hopefully seeing you speak at NLP-OSS 2020! 23 | 24 | Thank you in advance for your response. 25 | 26 | Best Regards, 27 | NLP-OSS Organizers 28 | -------------------------------------------------------------------------------- /nlposs-2023-admin-1/README.md: -------------------------------------------------------------------------------- 1 | # Files 2 | 3 | Files for organizers and EMNLP workshop chairs 4 | 5 | - Workshop Program / Schedule 6 | - program_workshop_nlposs.yml 7 | - papers_workshop_nlposs.yml 8 | -------------------------------------------------------------------------------- /nlposs-2023-admin-1/papers_workshop_nlposs.yml: -------------------------------------------------------------------------------- 1 | - id: 4 2 | authors: 3 | - first_name: Lester James V. 4 | last_name: Miranda 5 | institution: Online University (can leave it as it will not shown) 6 | attributes: 7 | paper_type: long (can leave it unchanged) 8 | presentation_type: poster (can leave it unchanged) 9 | title: calamanCy: A Tagalog Natural Language Processing Toolkit 10 | abstract: Abstract. (not necessary) 11 | 12 | - id: 5 13 | authors: 14 | - first_name: Michael 15 | last_name: Günther 16 | institution: Online University (can leave it as it will not shown) 17 | 18 | - first_name: Louis 19 | last_name: Milliken 20 | institution: Online University (can leave it as it will not shown) 21 | 22 | - first_name: Jonathan 23 | last_name: Geuter 24 | institution: Online University (can leave it as it will not shown) 25 | 26 | - first_name: Georgios 27 | last_name: Mastrapas 28 | institution: Online University (can leave it as it will not shown) 29 | 30 | - first_name: Bo 31 | last_name: Wang 32 | institution: Online University (can leave it as it will not shown) 33 | 34 | - first_name: Han 35 | last_name: Xiao 36 | institution: Online University (can leave it as it will not shown) 37 | attributes: 38 | paper_type: long (can leave it unchanged) 39 | presentation_type: poster (can leave it unchanged) 40 | title: Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models 41 | abstract: Abstract. (not necessary) 42 | 43 | - id: 6 44 | authors: 45 | - first_name: David 46 | last_name: Beauchemin 47 | institution: Online University (can leave it as it will not shown) 48 | 49 | - first_name: Marouane 50 | last_name: Yassine 51 | institution: Online University (can leave it as it will not shown) 52 | attributes: 53 | paper_type: long (can leave it unchanged) 54 | presentation_type: poster (can leave it unchanged) 55 | title: Deepparse : A State-Of-The-Art Library for Parsing Multinational Street Addresses 56 | abstract: Abstract. (not necessary) 57 | 58 | - id: 8 59 | authors: 60 | - first_name: Wannaphong 61 | last_name: Phatthiyaphaibun 62 | institution: Online University (can leave it as it will not shown) 63 | 64 | - first_name: Korakot 65 | last_name: Chaovavanich 66 | institution: Online University (can leave it as it will not shown) 67 | 68 | - first_name: Charin 69 | last_name: Polpanumas 70 | institution: Online University (can leave it as it will not shown) 71 | 72 | - first_name: Arthit 73 | last_name: Suriyawongkul 74 | institution: Online University (can leave it as it will not shown) 75 | 76 | - first_name: Lalita 77 | last_name: Lowphansirikul 78 | institution: Online University (can leave it as it will not shown) 79 | 80 | - first_name: Pattarawat 81 | last_name: Chormai 82 | institution: Online University (can leave it as it will not shown) 83 | 84 | - first_name: Peerat 85 | last_name: Limkonchotiwat 86 | institution: Online University (can leave it as it will not shown) 87 | 88 | - first_name: Thanathip 89 | last_name: Suntorntip 90 | institution: Online University (can leave it as it will not shown) 91 | 92 | - first_name: Can 93 | last_name: Udomcharoenchaikit 94 | institution: Online University (can leave it as it will not shown) 95 | attributes: 96 | paper_type: long (can leave it unchanged) 97 | presentation_type: poster (can leave it unchanged) 98 | title: PyThaiNLP: Thai Natural Language Processing in Python 99 | abstract: Abstract. (not necessary) 100 | 101 | - id: 9 102 | authors: 103 | - first_name: Petros 104 | last_name: Stavropoulos 105 | institution: Online University (can leave it as it will not shown) 106 | 107 | - first_name: Ioannis 108 | last_name: Lyris 109 | institution: Online University (can leave it as it will not shown) 110 | 111 | - first_name: Natalia 112 | last_name: Manola 113 | institution: Online University (can leave it as it will not shown) 114 | 115 | - first_name: Ioanna 116 | last_name: Grypari 117 | institution: Online University (can leave it as it will not shown) 118 | 119 | - first_name: Haris 120 | last_name: Papageorgiou 121 | institution: Online University (can leave it as it will not shown) 122 | attributes: 123 | paper_type: long (can leave it unchanged) 124 | presentation_type: poster (can leave it unchanged) 125 | title: Empowering Knowledge Discovery from Scientific Literature: A novel approach to Research Artifact Analysis 126 | abstract: Abstract. (not necessary) 127 | 128 | - id: 10 129 | authors: 130 | - first_name: Loïc 131 | last_name: Grobol 132 | institution: Online University (can leave it as it will not shown) 133 | attributes: 134 | paper_type: long (can leave it unchanged) 135 | presentation_type: poster (can leave it unchanged) 136 | title: Zelda Rose: a tool for hassle-free training of transformer models 137 | abstract: Abstract. (not necessary) 138 | 139 | - id: 11 140 | authors: 141 | - first_name: Yuvanesh 142 | last_name: Anand 143 | institution: Online University (can leave it as it will not shown) 144 | 145 | - first_name: Zach 146 | last_name: Nussbaum 147 | institution: Online University (can leave it as it will not shown) 148 | 149 | - first_name: Adam 150 | last_name: Treat 151 | institution: Online University (can leave it as it will not shown) 152 | 153 | - first_name: Aaron 154 | last_name: Miller 155 | institution: Online University (can leave it as it will not shown) 156 | 157 | - first_name: Richard 158 | last_name: Guo 159 | institution: Online University (can leave it as it will not shown) 160 | 161 | - first_name: Benjamin M 162 | last_name: Schmidt 163 | institution: Online University (can leave it as it will not shown) 164 | 165 | - first_name: Brandon 166 | last_name: Duderstadt 167 | institution: Online University (can leave it as it will not shown) 168 | 169 | - first_name: Andriy 170 | last_name: Mulyar 171 | institution: Online University (can leave it as it will not shown) 172 | attributes: 173 | paper_type: long (can leave it unchanged) 174 | presentation_type: poster (can leave it unchanged) 175 | title: GPT4All: An Ecosystem of Open Source Compressed Language Models 176 | abstract: Abstract. (not necessary) 177 | 178 | - id: 12 179 | authors: 180 | - first_name: Andrew 181 | last_name: Zhu 182 | institution: Online University (can leave it as it will not shown) 183 | 184 | - first_name: Liam 185 | last_name: Dugan 186 | institution: Online University (can leave it as it will not shown) 187 | 188 | - first_name: Alyssa 189 | last_name: Hwang 190 | institution: Online University (can leave it as it will not shown) 191 | 192 | - first_name: Chris 193 | last_name: Callison-Burch 194 | institution: Online University (can leave it as it will not shown) 195 | attributes: 196 | paper_type: long (can leave it unchanged) 197 | presentation_type: poster (can leave it unchanged) 198 | title: Kani: A Lightweight and Highly Hackable Framework for Language Model Applications 199 | abstract: Abstract. (not necessary) 200 | 201 | - id: 13 202 | authors: 203 | - first_name: Sanjna 204 | last_name: Kashyap 205 | institution: Online University (can leave it as it will not shown) 206 | 207 | - first_name: Zhaoyang 208 | last_name: Xie 209 | institution: Online University (can leave it as it will not shown) 210 | 211 | - first_name: Kenneth 212 | last_name: Steimel 213 | institution: Online University (can leave it as it will not shown) 214 | 215 | - first_name: Nitin 216 | last_name: Madnani 217 | institution: Online University (can leave it as it will not shown) 218 | attributes: 219 | paper_type: long (can leave it unchanged) 220 | presentation_type: poster (can leave it unchanged) 221 | title: Beyond the Repo: A Case Study on Open Source Integration with GECToR 222 | abstract: Abstract. (not necessary) 223 | 224 | - id: 16 225 | authors: 226 | - first_name: Lefteris 227 | last_name: Loukas 228 | institution: Online University (can leave it as it will not shown) 229 | 230 | - first_name: Manos 231 | last_name: Fergadiotis 232 | institution: Online University (can leave it as it will not shown) 233 | 234 | - first_name: Prodromos 235 | last_name: Malakasiotis 236 | institution: Online University (can leave it as it will not shown) 237 | attributes: 238 | paper_type: long (can leave it unchanged) 239 | presentation_type: poster (can leave it unchanged) 240 | title: EDGAR-CRAWLER: Finding Needles in the Haystack of Financial Documents 241 | abstract: Abstract. (not necessary) 242 | 243 | - id: 17 244 | authors: 245 | - first_name: Marcel 246 | last_name: Bollmann 247 | institution: Online University (can leave it as it will not shown) 248 | 249 | - first_name: Nathan 250 | last_name: Schneider 251 | institution: Online University (can leave it as it will not shown) 252 | 253 | - first_name: Arne 254 | last_name: Köhn 255 | institution: Online University (can leave it as it will not shown) 256 | 257 | - first_name: Matt 258 | last_name: Post 259 | institution: Online University (can leave it as it will not shown) 260 | attributes: 261 | paper_type: long (can leave it unchanged) 262 | presentation_type: poster (can leave it unchanged) 263 | title: Two Decades of the ACL Anthology: Development, Impact, and Open Challenges 264 | abstract: Abstract. (not necessary) 265 | 266 | - id: 18 267 | authors: 268 | - first_name: Piotr 269 | last_name: Nawrot 270 | institution: Online University (can leave it as it will not shown) 271 | attributes: 272 | paper_type: long (can leave it unchanged) 273 | presentation_type: poster (can leave it unchanged) 274 | title: nanoT5: Fast & Simple Pre-training and Fine-tuning of T5 Models with Limited Resources 275 | abstract: Abstract. (not necessary) 276 | 277 | - id: 19 278 | authors: 279 | - first_name: Salvatore 280 | last_name: Giorgi 281 | institution: Online University (can leave it as it will not shown) 282 | 283 | - first_name: Garrick 284 | last_name: Sherman 285 | institution: Online University (can leave it as it will not shown) 286 | 287 | - first_name: Douglas 288 | last_name: Bellew 289 | institution: Online University (can leave it as it will not shown) 290 | 291 | - first_name: Sharath Chandra 292 | last_name: Guntuku 293 | institution: Online University (can leave it as it will not shown) 294 | 295 | - first_name: Lyle 296 | last_name: Ungar 297 | institution: Online University (can leave it as it will not shown) 298 | 299 | - first_name: Brenda 300 | last_name: Curtis 301 | institution: Online University (can leave it as it will not shown) 302 | attributes: 303 | paper_type: long (can leave it unchanged) 304 | presentation_type: poster (can leave it unchanged) 305 | title: AWARE-TEXT: An Android Package for Mobile Phone Based Text Collection and On-Device Processing 306 | abstract: Abstract. (not necessary) 307 | 308 | - id: 20 309 | authors: 310 | - first_name: Matt 311 | last_name: Post 312 | institution: Online University (can leave it as it will not shown) 313 | 314 | - first_name: Thamme 315 | last_name: Gowda 316 | institution: Online University (can leave it as it will not shown) 317 | 318 | - first_name: Roman 319 | last_name: Grundkiewicz 320 | institution: Online University (can leave it as it will not shown) 321 | 322 | - first_name: Huda 323 | last_name: Khayrallah 324 | institution: Online University (can leave it as it will not shown) 325 | 326 | - first_name: Rohit 327 | last_name: Jain 328 | institution: Online University (can leave it as it will not shown) 329 | 330 | - first_name: Marcin 331 | last_name: Junczys-Dowmunt 332 | institution: Online University (can leave it as it will not shown) 333 | attributes: 334 | paper_type: long (can leave it unchanged) 335 | presentation_type: poster (can leave it unchanged) 336 | title: SOTASTREAM: A Streaming Approach to Machine Translation Training 337 | abstract: Abstract. (not necessary) 338 | 339 | - id: 21 340 | authors: 341 | - first_name: Siddharth 342 | last_name: Singh 343 | institution: Online University (can leave it as it will not shown) 344 | 345 | - first_name: Shyam 346 | last_name: Ratan 347 | institution: Online University (can leave it as it will not shown) 348 | 349 | - first_name: Neerav 350 | last_name: Mathur 351 | institution: Online University (can leave it as it will not shown) 352 | 353 | - first_name: Ritesh 354 | last_name: Kumar 355 | institution: Online University (can leave it as it will not shown) 356 | attributes: 357 | paper_type: long (can leave it unchanged) 358 | presentation_type: poster (can leave it unchanged) 359 | title: An Open-source Web-based Application for Development of Resources and Technologies in Underresourced Languages 360 | abstract: Abstract. (not necessary) 361 | 362 | - id: 22 363 | authors: 364 | - first_name: Andrej 365 | last_name: Jovanovic 366 | institution: Online University (can leave it as it will not shown) 367 | 368 | - first_name: Björn 369 | last_name: Ross 370 | institution: Online University (can leave it as it will not shown) 371 | attributes: 372 | paper_type: long (can leave it unchanged) 373 | presentation_type: poster (can leave it unchanged) 374 | title: Rumour Detection in the Wild: A Browser Extension for Twitter 375 | abstract: Abstract. (not necessary) 376 | 377 | - id: 23 378 | authors: 379 | - first_name: Changxu 380 | last_name: Duan 381 | institution: Online University (can leave it as it will not shown) 382 | 383 | - first_name: Sabine 384 | last_name: Bartsch 385 | institution: Online University (can leave it as it will not shown) 386 | attributes: 387 | paper_type: long (can leave it unchanged) 388 | presentation_type: poster (can leave it unchanged) 389 | title: LaTeX Rainbow: Open Source Document Layout Semantic Annotation Framework from LaTeX to PDF 390 | abstract: Abstract. (not necessary) 391 | 392 | - id: 24 393 | authors: 394 | - first_name: Paul 395 | last_name: Landes 396 | institution: Online University (can leave it as it will not shown) 397 | 398 | - first_name: Barbara Di 399 | last_name: Eugenio 400 | institution: Online University (can leave it as it will not shown) 401 | 402 | - first_name: Cornelia 403 | last_name: Caragea 404 | institution: Online University (can leave it as it will not shown) 405 | attributes: 406 | paper_type: long (can leave it unchanged) 407 | presentation_type: poster (can leave it unchanged) 408 | title: DeepZensols: A Deep Learning Natural Language Processing Framework for Experimentation and Reproducibility 409 | abstract: Abstract. (not necessary) 410 | 411 | - id: 25 412 | authors: 413 | - first_name: Constantine 414 | last_name: Lignos 415 | institution: Online University (can leave it as it will not shown) 416 | 417 | - first_name: Maya 418 | last_name: Kruse 419 | institution: Online University (can leave it as it will not shown) 420 | 421 | - first_name: Andrew 422 | last_name: Rueda 423 | institution: Online University (can leave it as it will not shown) 424 | attributes: 425 | paper_type: long (can leave it unchanged) 426 | presentation_type: poster (can leave it unchanged) 427 | title: Improving NER Research Workflows with SeqScore 428 | abstract: Abstract. (not necessary) 429 | 430 | - id: 26 431 | authors: 432 | - first_name: Yoshitomo 433 | last_name: Matsubara 434 | institution: Online University (can leave it as it will not shown) 435 | attributes: 436 | paper_type: long (can leave it unchanged) 437 | presentation_type: poster (can leave it unchanged) 438 | title: torchdistill Meets Hugging Face Libraries for Reproducible, Coding-free Deep Learning Studies: A Case Study on NLP 439 | abstract: Abstract. (not necessary) 440 | 441 | - id: 27 442 | authors: 443 | - first_name: Vivek 444 | last_name: Miglani 445 | institution: Online University (can leave it as it will not shown) 446 | 447 | - first_name: Aobo 448 | last_name: Yang 449 | institution: Online University (can leave it as it will not shown) 450 | 451 | - first_name: Aram H. 452 | last_name: Markosyan 453 | institution: Online University (can leave it as it will not shown) 454 | 455 | - first_name: Diego 456 | last_name: Garcia-Olano 457 | institution: Online University (can leave it as it will not shown) 458 | 459 | - first_name: Narine 460 | last_name: Kokhlikyan 461 | institution: Online University (can leave it as it will not shown) 462 | attributes: 463 | paper_type: long (can leave it unchanged) 464 | presentation_type: poster (can leave it unchanged) 465 | title: Using Captum to Explain Generative Language Models 466 | abstract: Abstract. (not necessary) 467 | 468 | - id: 28 469 | authors: 470 | - first_name: Felix 471 | last_name: Stollenwerk 472 | institution: Online University (can leave it as it will not shown) 473 | attributes: 474 | paper_type: long (can leave it unchanged) 475 | presentation_type: poster (can leave it unchanged) 476 | title: nerblackbox: A High-level Library for Named Entity Recognition in Python 477 | abstract: Abstract. (not necessary) 478 | 479 | - id: 29 480 | authors: 481 | - first_name: Chris 482 | last_name: Hokamp 483 | institution: Online University (can leave it as it will not shown) 484 | 485 | - first_name: Demian Gholipour 486 | last_name: Ghalandari 487 | institution: Online University (can leave it as it will not shown) 488 | 489 | - first_name: Parsa 490 | last_name: Ghaffari 491 | institution: Online University (can leave it as it will not shown) 492 | attributes: 493 | paper_type: long (can leave it unchanged) 494 | presentation_type: poster (can leave it unchanged) 495 | title: News Signals: An NLP Library for Text and Time Series 496 | abstract: Abstract. (not necessary) 497 | 498 | - id: 30 499 | authors: 500 | - first_name: Shubhanshu 501 | last_name: Mishra 502 | institution: Online University (can leave it as it will not shown) 503 | 504 | - first_name: Jana 505 | last_name: Diesner 506 | institution: Online University (can leave it as it will not shown) 507 | attributes: 508 | paper_type: long (can leave it unchanged) 509 | presentation_type: poster (can leave it unchanged) 510 | title: PyTAIL: An Open Source Tool for Interactive and Incremental Learning of NLP Models with Human in the Loop for Online Data 511 | abstract: Abstract. (not necessary) 512 | 513 | - id: 32 514 | authors: 515 | - first_name: Hrishikesh 516 | last_name: Terdalkar 517 | institution: Online University (can leave it as it will not shown) 518 | 519 | - first_name: Arnab 520 | last_name: Bhattacharya 521 | institution: Online University (can leave it as it will not shown) 522 | attributes: 523 | paper_type: long (can leave it unchanged) 524 | presentation_type: poster (can leave it unchanged) 525 | title: Antarlekhaka: A Comprehensive Tool for Multi-task Natural Language Annotation 526 | abstract: Abstract. (not necessary) 527 | 528 | - id: 33 529 | authors: 530 | - first_name: Bang 531 | last_name: Fu 532 | institution: Online University (can leave it as it will not shown) 533 | 534 | - first_name: Di 535 | last_name: Feng 536 | institution: Online University (can leave it as it will not shown) 537 | attributes: 538 | paper_type: long (can leave it unchanged) 539 | presentation_type: poster (can leave it unchanged) 540 | title: GPTCache: An Open-Source Semantic Cache for LLM Applications Enabling Faster Answers and Cost Savings 541 | abstract: Abstract. (not necessary) 542 | 543 | - id: 94 544 | authors: 545 | - first_name: Dung Nguyen 546 | last_name: Manh 547 | institution: Online University (can leave it as it will not shown) 548 | 549 | - first_name: Nam Le 550 | last_name: Hai 551 | institution: Online University (can leave it as it will not shown) 552 | 553 | - first_name: Anh T. V. 554 | last_name: Dau 555 | institution: Online University (can leave it as it will not shown) 556 | 557 | - first_name: Anh Minh 558 | last_name: Nguyen 559 | institution: Online University (can leave it as it will not shown) 560 | 561 | - first_name: Khanh 562 | last_name: Nghiem 563 | institution: Online University (can leave it as it will not shown) 564 | 565 | - first_name: Jin 566 | last_name: Guo 567 | institution: Online University (can leave it as it will not shown) 568 | 569 | - first_name: Nghi D. Q. 570 | last_name: Bui 571 | institution: Online University (can leave it as it will not shown) 572 | attributes: 573 | paper_type: long (can leave it unchanged) 574 | presentation_type: poster (can leave it unchanged) 575 | title: The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation 576 | abstract: Abstract. (not necessary) 577 | 578 | - id: 99 579 | authors: 580 | - first_name: William 581 | last_name: Tjhi 582 | institution: Online University (can leave it as it will not shown) 583 | 584 | - first_name: David 585 | last_name: Ong 586 | institution: Online University (can leave it as it will not shown) 587 | 588 | - first_name: Peerat 589 | last_name: Limkonchotiwat 590 | institution: Online University (can leave it as it will not shown) 591 | attributes: 592 | paper_type: long (can leave it unchanged) 593 | presentation_type: poster (can leave it unchanged) 594 | title: SEA-LION (Southeast Asian Languages In One Network): A Family of Southeast Asian Language Models 595 | abstract: Abstract. (not necessary) 596 | 597 | - id: 97 598 | authors: 599 | - first_name: Louis 600 | last_name: Castricato 601 | institution: Online University (can leave it as it will not shown) 602 | attributes: 603 | paper_type: long (can leave it unchanged) 604 | presentation_type: poster (can leave it unchanged) 605 | title: trlX: A Framework for Large Scale Open Source RLHF 606 | abstract: Abstract. (not necessary) 607 | 608 | - id: 95 609 | authors: 610 | - first_name: Brandon 611 | last_name: Duderstadt 612 | institution: Online University (can leave it as it will not shown) 613 | 614 | - first_name: Yuvanesh 615 | last_name: Anand 616 | institution: Online University (can leave it as it will not shown) 617 | attributes: 618 | paper_type: long (can leave it unchanged) 619 | presentation_type: poster (can leave it unchanged) 620 | title: Towards Explainable and Accessible AI 621 | abstract: Abstract. (not necessary) -------------------------------------------------------------------------------- /nlposs-2023-admin-1/program_workshop_nlposs.yml: -------------------------------------------------------------------------------- 1 | - title: Opening Remarks 2 | start_time: 2023-12-06 09:00:00 3 | end_time: 2023-12-06 09:15:00 4 | 5 | - title: "Invited Talk 1" 6 | start_time: 2023-12-06 09:15:00 7 | end_time: 2023-12-06 10:15:00 8 | location: Hybrid 9 | papers: 10 | - id: 99 11 | start_time: 2023-12-06 09:15:00 12 | end_time: 2023-12-06 10:15:00 13 | 14 | - title: Coffee Break 15 | start_time: 2023-12-06 10:30:00 16 | end_time: 2023-12-06 11:00:00 17 | 18 | - title: "Lightning Session 1" 19 | start_time: 2023-12-06 11:00:00 20 | end_time: 2023-12-06 11:30:00 21 | location: Hybrid 22 | 23 | - title: "Poster Session 1" 24 | start_time: 2023-12-06 11:30:00 25 | end_time: 2023-12-06 12:15:00 26 | location: Hybrid 27 | papers: 28 | - id: 5 29 | - id: 6 30 | - id: 8 31 | - id: 10 32 | - id: 12 33 | - id: 13 34 | - id: 16 35 | - id: 17 36 | - id: 18 37 | - id: 19 38 | - id: 20 39 | - id: 21 40 | - id: 22 41 | start_time: 2023-12-06 11:30:00 42 | end_time: 2023-12-06 12:15:00 43 | 44 | - title: Lunch Break 45 | start_time: 2023-12-06 12:15:00 46 | end_time: 2023-12-06 13:45:00 47 | 48 | - title: "Invited Talk 2" 49 | start_time: 2023-12-06 13:45:00 50 | end_time: 2023-12-06 14:45:00 51 | location: Hybrid 52 | papers: 53 | - id: 97 54 | start_time: 2023-12-06 13:45:00 55 | end_time: 2023-12-06 14:45:00 56 | 57 | - title: "Lightning Session 2" 58 | start_time: 2023-12-06 14:45:00 59 | end_time: 2023-12-06 15:15:00 60 | location: Hybrid 61 | 62 | - title: Coffee Break 63 | start_time: 2023-12-06 15:15:00 64 | end_time: 2023-12-06 15:30:00 65 | 66 | - title: "Poster Session 2" 67 | start_time: 2023-12-06 15:30:00 68 | end_time: 2023-12-06 16:15:00 69 | location: Hybrid 70 | papers: 71 | - id: 11 72 | - id: 23 73 | - id: 24 74 | - id: 25 75 | - id: 26 76 | - id: 27 77 | - id: 28 78 | - id: 29 79 | - id: 30 80 | - id: 33 81 | - id: 94 82 | start_time: 2023-12-06 15:30:00 83 | end_time: 2023-12-06 16:15:00 84 | 85 | 86 | - title: "Invited Talk 3" 87 | start_time: 2023-12-06 16:15:00 88 | end_time: 2023-12-06 17:15:00 89 | location: Hybrid 90 | papers: 91 | - id: 95 92 | start_time: 2023-12-06 16:15:00 93 | end_time: 2023-12-06 17:15:00 94 | 95 | - title: Closing Remarks 96 | start_time: 2023-12-06 17:15:00 97 | end_time: 2023-12-06 17:30:00 98 | -------------------------------------------------------------------------------- /nlposs-2023/04-calamanCy-Poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/04-calamanCy-Poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/04-calamanCy-Presentation.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/04-calamanCy-Presentation.pdf -------------------------------------------------------------------------------- /nlposs-2023/05-Jina-Embeddings-A-Novel-Set-of-High-Performance-Sentence-Embedding-Models-Poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/05-Jina-Embeddings-A-Novel-Set-of-High-Performance-Sentence-Embedding-Models-Poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/05-Jina-Embeddings-A-Novel-Set-of-High-Performance-Sentence-Embedding-Models-Slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/05-Jina-Embeddings-A-Novel-Set-of-High-Performance-Sentence-Embedding-Models-Slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/06-Deepparse EMNLP.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/06-Deepparse EMNLP.pdf -------------------------------------------------------------------------------- /nlposs-2023/06-Deepparse___An_Extendable__and_Fine_Tunable_State_Of_The_Art_Library_for_Parsing_Multinational_Street_Addresses_Poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/06-Deepparse___An_Extendable__and_Fine_Tunable_State_Of_The_Art_Library_for_Parsing_Multinational_Street_Addresses_Poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/08-PyThaiNLP-Poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/08-PyThaiNLP-Poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/08-PyThaiNLP-Slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/08-PyThaiNLP-Slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/09-Empowering_KD_from_SL_RAA_Poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/09-Empowering_KD_from_SL_RAA_Poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/09-Empowering_KD_from_SL_RAA_lightning_slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/09-Empowering_KD_from_SL_RAA_lightning_slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/10-lgrobol-zeldarose.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/10-lgrobol-zeldarose.pdf -------------------------------------------------------------------------------- /nlposs-2023/11-gpt4all_lightning_slide.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/11-gpt4all_lightning_slide.png -------------------------------------------------------------------------------- /nlposs-2023/11-gpt4all_poster.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/11-gpt4all_poster.png -------------------------------------------------------------------------------- /nlposs-2023/12-kani-a-lightweight-and-highly-hackable-framework-for-building-language-model-applications-poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/12-kani-a-lightweight-and-highly-hackable-framework-for-building-language-model-applications-poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/12-kani-a-lightweight-and-highly-hackable-framework-for-building-language-model-applications-slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/12-kani-a-lightweight-and-highly-hackable-framework-for-building-language-model-applications-slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/13_gector_case_study_poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/13_gector_case_study_poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/13_gector_case_study_ppt.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/13_gector_case_study_ppt.pdf -------------------------------------------------------------------------------- /nlposs-2023/16-EDGAR_CRAWLER_poster.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/16-EDGAR_CRAWLER_poster.png -------------------------------------------------------------------------------- /nlposs-2023/16-EDGAR_CRAWLER_slide_lightning_presentation.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/16-EDGAR_CRAWLER_slide_lightning_presentation.pdf -------------------------------------------------------------------------------- /nlposs-2023/17-ACL-Anthology-Poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/17-ACL-Anthology-Poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/17-ACL-Anthology-Slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/17-ACL-Anthology-Slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/18-nanoT5-Poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/18-nanoT5-Poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/18-nanoT5-Slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/18-nanoT5-Slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/19_AWAREText_lightning_talk_slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/19_AWAREText_lightning_talk_slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/19_AWAREText_poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/19_AWAREText_poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/20-sotastream-poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/20-sotastream-poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/20-sotastream-slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/20-sotastream-slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/21_LiFE_Poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/21_LiFE_Poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/21_LiFE_Slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/21_LiFE_Slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/22-rumour-detection-in-the-wild-poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/22-rumour-detection-in-the-wild-poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/22-rumour-detection-in-the-wild-slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/22-rumour-detection-in-the-wild-slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/23-LaTeX-Rainbow-Poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/23-LaTeX-Rainbow-Poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/23-LaTeX-Rainbow-Slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/23-LaTeX-Rainbow-Slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/24-LP-DeepZensolsLightingPres.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/24-LP-DeepZensolsLightingPres.pdf -------------------------------------------------------------------------------- /nlposs-2023/24-Poster-DeepZensolsPoster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/24-Poster-DeepZensolsPoster.pdf -------------------------------------------------------------------------------- /nlposs-2023/25-SeqScore-Poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/25-SeqScore-Poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/25-SeqScore-Slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/25-SeqScore-Slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/26-torchdistill_nlp-oss2023_poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/26-torchdistill_nlp-oss2023_poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/26-torchdistill_nlp-oss2023_slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/26-torchdistill_nlp-oss2023_slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/27-captum-poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/27-captum-poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/27-captum-slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/27-captum-slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/28-nerblackbox-poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/28-nerblackbox-poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/28-nerblackbox-presentation.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/28-nerblackbox-presentation.pdf -------------------------------------------------------------------------------- /nlposs-2023/29-NewsSignals-lightning-slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/29-NewsSignals-lightning-slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/29-NewsSignals-poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/29-NewsSignals-poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/30-PyTAIL-Poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/30-PyTAIL-Poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/30-PyTAIL-Slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/30-PyTAIL-Slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/32_Antarlekhaka_A_Comprehensive_Tool_for_Multi-task_Natural_Language_Annotation_Poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/32_Antarlekhaka_A_Comprehensive_Tool_for_Multi-task_Natural_Language_Annotation_Poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/32_Antarlekhaka_A_Comprehensive_Tool_for_Multi-task_Natural_Language_Annotation_Slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/32_Antarlekhaka_A_Comprehensive_Tool_for_Multi-task_Natural_Language_Annotation_Slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/33-GPTCache-poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/33-GPTCache-poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/33-GPTCache-slider.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/33-GPTCache-slider.pdf -------------------------------------------------------------------------------- /nlposs-2023/Check-posters.ipynb: -------------------------------------------------------------------------------- 1 | { 2 | "cells": [ 3 | { 4 | "cell_type": "code", 5 | "execution_count": 1, 6 | "id": "fe1f69d1", 7 | "metadata": {}, 8 | "outputs": [], 9 | "source": [ 10 | "papers = \"\"\"4\n", 11 | "5\n", 12 | "6\n", 13 | "8\n", 14 | "9\n", 15 | "10\n", 16 | "11\n", 17 | "12\n", 18 | "13\n", 19 | "16\n", 20 | "17\n", 21 | "18\n", 22 | "19\n", 23 | "20\n", 24 | "21\n", 25 | "22\n", 26 | "23\n", 27 | "24\n", 28 | "25\n", 29 | "26\n", 30 | "27\n", 31 | "28\n", 32 | "29\n", 33 | "30\n", 34 | "32\n", 35 | "33\"\"\".split('\\n')" 36 | ] 37 | }, 38 | { 39 | "cell_type": "code", 40 | "execution_count": 5, 41 | "id": "cd0d0673", 42 | "metadata": {}, 43 | "outputs": [], 44 | "source": [ 45 | "papers = [int(i) for i in papers]" 46 | ] 47 | }, 48 | { 49 | "cell_type": "code", 50 | "execution_count": 6, 51 | "id": "cfb46228", 52 | "metadata": {}, 53 | "outputs": [ 54 | { 55 | "name": "stdout", 56 | "output_type": "stream", 57 | "text": [ 58 | "04-calamanCy-Poster.pdf\r\n", 59 | "04-calamanCy-Presentation.pdf\r\n", 60 | "05-Jina-Embeddings-A-Novel-Set-of-High-Performance-Sentence-Embedding-Models-Poster.pdf\r\n", 61 | "05-Jina-Embeddings-A-Novel-Set-of-High-Performance-Sentence-Embedding-Models-Slide.pdf\r\n", 62 | "08-PyThaiNLP-Poster.pdf\r\n", 63 | "08-PyThaiNLP-Slide.pdf\r\n", 64 | "09-Empowering_KD_from_SL_RAA_Poster.pdf\r\n", 65 | "09-Empowering_KD_from_SL_RAA_lightning_slide.pdf\r\n", 66 | "11-gpt4all_lightning_slide.png\r\n", 67 | "11-gpt4all_poster.png\r\n", 68 | "12-kani-a-lightweight-and-highly-hackable-framework-for-building-language-model-applications-poster.pdf\r\n", 69 | "12-kani-a-lightweight-and-highly-hackable-framework-for-building-language-model-applications-slide.pdf\r\n", 70 | "13_gector_case_study_poster.pdf\r\n", 71 | "13_gector_case_study_ppt.pdf\r\n", 72 | "16-EDGAR_CRAWLER_poster.png\r\n", 73 | "16-EDGAR_CRAWLER_slide_lightning_presentation.pdf\r\n", 74 | "17-ACL-Anthology-Poster.pdf\r\n", 75 | "17-ACL-Anthology-Slide.pdf\r\n", 76 | "18-nanoT5-Poster.pdf\r\n", 77 | "18-nanoT5-Slide.pdf\r\n", 78 | "20-sotastream-poster.pdf\r\n", 79 | "20-sotastream-slide.pdf\r\n", 80 | "21_LiFE_Poster.pdf\r\n", 81 | "21_LiFE_Slide.pdf\r\n", 82 | "22-rumour-detection-in-the-wild-poster.pdf\r\n", 83 | "22-rumour-detection-in-the-wild-slide.pdf\r\n", 84 | "23-LaTeX-Rainbow-Poster.pdf\r\n", 85 | "23-LaTeX-Rainbow-Slide.pdf\r\n", 86 | "24-LP-DeepZensolsLightingPres.pdf\r\n", 87 | "24-Poster-DeepZensolsPoster.pdf\r\n", 88 | "26-torchdistill_nlp-oss2023_poster.pdf\r\n", 89 | "26-torchdistill_nlp-oss2023_slide.pdf\r\n", 90 | "27-captum-poster.pdf\r\n", 91 | "27-captum-slide.pdf\r\n", 92 | "28-nerblackbox-poster.pdf\r\n", 93 | "28-nerblackbox-presentation.pdf\r\n", 94 | "29-NewsSignals-lightning-slide.pdf\r\n", 95 | "29-NewsSignals-poster.pdf\r\n", 96 | "30-PyTAIL-Poster.pdf\r\n", 97 | "30-PyTAIL-Slide.pdf\r\n", 98 | "32_Antarlekhaka_A_Comprehensive_Tool_for_Multi-task_Natural_Language_Annotation_Poster.pdf\r\n", 99 | "32_Antarlekhaka_A_Comprehensive_Tool_for_Multi-task_Natural_Language_Annotation_Slide.pdf\r\n", 100 | "Findings-The_Vault_23_10.pdf\r\n", 101 | "Findings-The_Vault_23_10_Poster.pdf\r\n", 102 | "Findings-The_Vault_23_10_Slide.pdf\r\n", 103 | "README.md\r\n", 104 | "Untitled.ipynb\r\n" 105 | ] 106 | } 107 | ], 108 | "source": [ 109 | "ls" 110 | ] 111 | }, 112 | { 113 | "cell_type": "code", 114 | "execution_count": 7, 115 | "id": "096e1b9b", 116 | "metadata": {}, 117 | "outputs": [], 118 | "source": [ 119 | "x = \"\"\"04-calamanCy-Poster.pdf\n", 120 | "04-calamanCy-Presentation.pdf\n", 121 | "05-Jina-Embeddings-A-Novel-Set-of-High-Performance-Sentence-Embedding-Models-Poster.pdf\n", 122 | "05-Jina-Embeddings-A-Novel-Set-of-High-Performance-Sentence-Embedding-Models-Slide.pdf\n", 123 | "08-PyThaiNLP-Poster.pdf\n", 124 | "08-PyThaiNLP-Slide.pdf\n", 125 | "09-Empowering_KD_from_SL_RAA_Poster.pdf\n", 126 | "09-Empowering_KD_from_SL_RAA_lightning_slide.pdf\n", 127 | "11-gpt4all_lightning_slide.png\n", 128 | "11-gpt4all_poster.png\n", 129 | "12-kani-a-lightweight-and-highly-hackable-framework-for-building-language-model-applications-poster.pdf\n", 130 | "12-kani-a-lightweight-and-highly-hackable-framework-for-building-language-model-applications-slide.pdf\n", 131 | "13_gector_case_study_poster.pdf\n", 132 | "13_gector_case_study_ppt.pdf\n", 133 | "16-EDGAR_CRAWLER_poster.png\n", 134 | "16-EDGAR_CRAWLER_slide_lightning_presentation.pdf\n", 135 | "17-ACL-Anthology-Poster.pdf\n", 136 | "17-ACL-Anthology-Slide.pdf\n", 137 | "18-nanoT5-Poster.pdf\n", 138 | "18-nanoT5-Slide.pdf\n", 139 | "20-sotastream-poster.pdf\n", 140 | "20-sotastream-slide.pdf\n", 141 | "21_LiFE_Poster.pdf\n", 142 | "21_LiFE_Slide.pdf\n", 143 | "22-rumour-detection-in-the-wild-poster.pdf\n", 144 | "22-rumour-detection-in-the-wild-slide.pdf\n", 145 | "23-LaTeX-Rainbow-Poster.pdf\n", 146 | "23-LaTeX-Rainbow-Slide.pdf\n", 147 | "24-LP-DeepZensolsLightingPres.pdf\n", 148 | "24-Poster-DeepZensolsPoster.pdf\n", 149 | "26-torchdistill_nlp-oss2023_poster.pdf\n", 150 | "26-torchdistill_nlp-oss2023_slide.pdf\n", 151 | "27-captum-poster.pdf\n", 152 | "27-captum-slide.pdf\n", 153 | "28-nerblackbox-poster.pdf\n", 154 | "28-nerblackbox-presentation.pdf\n", 155 | "29-NewsSignals-lightning-slide.pdf\n", 156 | "29-NewsSignals-poster.pdf\n", 157 | "30-PyTAIL-Poster.pdf\n", 158 | "30-PyTAIL-Slide.pdf\n", 159 | "32_Antarlekhaka_A_Comprehensive_Tool_for_Multi-task_Natural_Language_Annotation_Poster.pdf\n", 160 | "32_Antarlekhaka_A_Comprehensive_Tool_for_Multi-task_Natural_Language_Annotation_Slide.pdf\"\"\".split('\\n')" 161 | ] 162 | }, 163 | { 164 | "cell_type": "code", 165 | "execution_count": 10, 166 | "id": "d7f5f0e9", 167 | "metadata": {}, 168 | "outputs": [], 169 | "source": [ 170 | "posters = set([int(i.split('-')[0].split('_')[0]) for i in x])" 171 | ] 172 | }, 173 | { 174 | "cell_type": "code", 175 | "execution_count": 11, 176 | "id": "b750a06a", 177 | "metadata": {}, 178 | "outputs": [ 179 | { 180 | "data": { 181 | "text/plain": [ 182 | "{6, 10, 19, 25, 33}" 183 | ] 184 | }, 185 | "execution_count": 11, 186 | "metadata": {}, 187 | "output_type": "execute_result" 188 | } 189 | ], 190 | "source": [ 191 | "set(papers).difference(posters)" 192 | ] 193 | }, 194 | { 195 | "cell_type": "code", 196 | "execution_count": null, 197 | "id": "cff505fa", 198 | "metadata": {}, 199 | "outputs": [], 200 | "source": [] 201 | } 202 | ], 203 | "metadata": { 204 | "kernelspec": { 205 | "display_name": "Python 3", 206 | "language": "python", 207 | "name": "python3" 208 | }, 209 | "language_info": { 210 | "codemirror_mode": { 211 | "name": "ipython", 212 | "version": 3 213 | }, 214 | "file_extension": ".py", 215 | "mimetype": "text/x-python", 216 | "name": "python", 217 | "nbconvert_exporter": "python", 218 | "pygments_lexer": "ipython3", 219 | "version": "3.8.2" 220 | } 221 | }, 222 | "nbformat": 4, 223 | "nbformat_minor": 5 224 | } 225 | -------------------------------------------------------------------------------- /nlposs-2023/Findings-The_Vault_23_10.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/Findings-The_Vault_23_10.pdf -------------------------------------------------------------------------------- /nlposs-2023/Findings-The_Vault_23_10_Poster.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/Findings-The_Vault_23_10_Poster.pdf -------------------------------------------------------------------------------- /nlposs-2023/Findings-The_Vault_23_10_Slide.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/Findings-The_Vault_23_10_Slide.pdf -------------------------------------------------------------------------------- /nlposs-2023/README.md: -------------------------------------------------------------------------------- 1 | Instruction 2 | ==== 3 | 4 | Please upload your paper's poster and 1-slide lightning presentation here! 5 | 6 | If you require help or present to send the poster and slide to us through email, please contact nlposs.workshop@gmail.com 7 | 8 | 9 | Finalized schedule on https://docs.google.com/presentation/d/13RP8y1uUGgfYAJbEo8h8waNaJ0zFpu9QhAdy3ruGuh0/edit?usp=sharing 10 | 11 | Lightning session 1: https://smallpdf.com/file#s=c2598260-dfba-4d1f-b2a6-8674711d8c9a 12 | 13 | Lightning session 2: https://smallpdf.com/file#s=0011a1cf-8d93-4f13-98cd-1ec7c638f95d 14 | -------------------------------------------------------------------------------- /nlposs-2023/lightning-session-1.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/lightning-session-1.pdf -------------------------------------------------------------------------------- /nlposs-2023/lightning-slides-2-compressed.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/lightning-slides-2-compressed.pdf -------------------------------------------------------------------------------- /nlposs-2023/lightning-slides-2.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nlposs/NLP-OSS/712933f4af9a43020eeec764fdfd551c1e9bf29c/nlposs-2023/lightning-slides-2.pdf --------------------------------------------------------------------------------