Track Awesome Nlg Updates Weekly
A curated list of resources dedicated to Natural Language Generation (NLG)
🏠 Home · 🔍 Search · 🔥 Feed · 📮 Subscribe · ❤️ Sponsor · 😺 accelerated-text/awesome-nlg · ⭐ 424 · 🏷️ Computer Science
Aug 28 - Sep 03, 2023
Neural Natural Language Generation
- lightnlg (⭐3) - A minimalistic codebase for finetuning and interacting with NLG models using PyTorch Lightning.
Jan 30 - Feb 05, 2023
Libraries
- Writebot - A NodeJS library that makes it easier to use GPT-3 by using presets.
Mar 21 - Mar 27, 2022
Papers and Articles
Apr 12 - Apr 18, 2021
Datasets
- WikiBio - wikipedia biography dataset - This dataset gathers 728,321 biographies from wikipedia. It aims at evaluating text generation algorithms.
Mar 22 - Mar 28, 2021
Papers and Articles
Feb 01 - Feb 07, 2021
Evaluation
- GEM - a benchmark environment for NLG with a focus on its Evaluation, both through human annotations and automated Metrics.
Jan 18 - Jan 24, 2021
Evaluation
Papers and Articles
Nov 30 - Dec 06, 2020
Neural Natural Language Generation
- PaperRobot: Incremental Draft Generation of Scientific Ideas (⭐468) - We present a PaperRobot who performs as an automatic research assistant.
- Question Generation using hugstransformers (⭐981) - Question generation is the task of automatically generating questions from a text paragraph.
Papers and Articles
Nov 16 - Nov 22, 2020
Papers and Articles
Jun 22 - Jun 28, 2020
Papers and Articles
Jun 01 - Jun 07, 2020
Neural Natural Language Generation
- This Word Does Not Exist (⭐1k) - This is a project allows people to train a variant of GPT-2 that makes up words, definitions and examples from scratch.
May 25 - May 31, 2020
Neural Natural Language Generation
- aitextgen (⭐1.8k) - A robust Python tool for text-based AI training and generation using GPT-2.
May 18 - May 24, 2020
Datasets
- The Schema-Guided Dialogue Dataset (⭐466) - The Schema-Guided Dialogue (SGD) dataset consists of over 20k annotated multi-domain, task-oriented conversations between a human and a virtual assistant.
Apr 27 - May 03, 2020
Libraries
- Number Words (⭐194) - Convert a number to an approximated text expression: from '0.23' to 'less than a quarter'.
Apr 13 - Apr 19, 2020
Products
- RosaeNLG - An open-source library for node.js or client side (browser) execution, based on the Pug template engine, to generate texts in English, French, German and Italian.
Mar 09 - Mar 15, 2020
Videos
Feb 24 - Mar 01, 2020
Evaluation
- compare-mt (⭐446) - A tool for holistic analysis of language generations systems.
Jan 20 - Jan 26, 2020
Dialog
- Plato (⭐970) - This is the Plato Research Dialogue System, a flexible platform for developing conversational AI agents.
- RNNLG (⭐488) - RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains.
Jan 13 - Jan 19, 2020
Datasets
- Alex Context NLG Dataset (⭐23) - A dataset for NLG in dialogue systems in the public transport information domain.
- Box-score data (⭐105) - This dataset consists of (human-written) NBA basketball game summaries aligned with their corresponding box- and line-scores.
- WeatherGov - Computer-generated weather forecasts from weather.gov (US public forecast), along with corresponding weather data.
- WebNLG (⭐65) - The enriched version of the WebNLG - a resource for evaluating common NLG tasks, including Discourse Ordering, Lexicalization and Referring Expression Generation.
- The Wikipedia company corpus - Company descriptions collected from Wikipedia. The dataset contains semantic representations, short, and long descriptions for 51K companies in English.
Neural Natural Language Generation
- Texar (⭐2.4k) - Texar is a toolkit aiming to support a broad set of machine learning, especially natural language processing and text generation tasks.
Jan 06 - Jan 12, 2020
Datasets
- Neural-Wikipedian (⭐10) - The repository contains the code along with the required corpora that were used in order to build a system that "learns" how to generate English biographies for Semantic Web triples.
Videos
Dec 16 - Dec 22, 2019
Neural Natural Language Generation
- PPLM (⭐1.1k) - Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.
Dec 09 - Dec 15, 2019
Dialog
- Chatito (⭐844) - Generate datasets for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!
Neural Natural Language Generation
- textgenrnn (⭐4.9k) - Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.
Videos
Dec 02 - Dec 08, 2019
Libraries
- Cron Expression Descriptor (⭐898) - A .NET library that converts cron expressions into human readable descriptions.
Nov 25 - Dec 01, 2019
Realizers
- Genl (⭐22) - Surface realiser (part of a Natural Language Generation system) using Tree Adjoining Grammar.
- JSrealB (⭐20) - A JavaScript bilingual text realizer for web development.
Nov 18 - Nov 24, 2019
Neural Natural Language Generation
- graph-2-text (⭐151) - Graph to sequence implemented in Pytorch combining Graph convolutional networks and opennmt-py.
- Transformers (⭐111k) - State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
Nov 11 - Nov 17, 2019
Datasets
- E2E - This shared task focuses on recent end-to-end (E2E), data-driven NLG methods, which jointly learn sentence planning and surface realisation from non-aligned data.
- YelpNLG - YelpNLG provides resources for natural language generation of restaurant reviews.
Evaluation
- NLG-eval (⭐1.2k) - Evaluation code for various unsupervised automated metrics for Natural Language Generation.
- VizSeq (⭐434) - A Visual Analysis Toolkit for Text Generation Tasks.
Grammar
- GrammaticalFramework - A programming language for multilingual grammar applications.
- CCGweb (⭐6) - A Web platform for parsing and annotation.
Narrative Generation
- Tracery (⭐2.1k) - A story-grammar generation library for JavaScript.
Neural Natural Language Generation
- Image Caption Generator (⭐145) - A Neural Network based generative model for captioning images using Tensorflow.
- Summary Generation From Structured Data (⭐183) - For converting information present in the form of structured data into natural language text.
Products
- Accelerated Text (⭐734) - Automatically generate multiple natural language descriptions of your data varying in wording and structure.
- Twine - An open-source tool for telling interactive, nonlinear stories.
Realizers
- SimpleNLG-EnFr (⭐25) - SimpleNLG-EnFr 1.1 is a bilingual English/French adaption of SimpleNLG v4.2.
Templating Languages
- calyx (⭐60) - A Ruby library for generating text with recursive template grammars.
- nalgene (⭐54) - Natural language generation language.
- StringTemplate - Java template engine (with ports for C##, Objective-C, JavaScript, Scala) for generating source code, web pages, emails, or any other formatted text output.
Nov 04 - Nov 10, 2019
Grammar
- OpenCCG (⭐201) - OpenCCG library for parsing and realization with CCG.
- EasyCCG (⭐58) - CCG: All combinators, common grammar format, parsing to logical form, parameter estimation for probabilistic CCG.
- CCG Lab (⭐26) - All combinators, common grammar format, parsing to logical form, parameter estimation for probabilistic CCG.
Narrative Generation
- Random Story Generator (⭐62) - Using Natural Language Generation (NLG) to create a random short story.
Realizers
- SimpleNLG (⭐795) - Java API for Natural Language Generation.
- SimpleNLG DE (⭐16) - German version of SimpleNLG 4.
Oct 28 - Nov 03, 2019
Dialog
- NNDIAL (⭐346) - NNDial is an open source toolkit for building end-to-end trainable task-oriented dialogue models.
- TGen (⭐204) - Statistical NLG for spoken dialogue systems.