Awesome List Updates on Jan 20, 2024
7 awesome lists updated today.
🏠 Home · 🔍 Search · 🔥 Feed · 📮 Subscribe · ❤️ Sponsor
1. Awesome Agi Cocosci
Domain Specific Language / Declarative DSL Applications
- Learning the language of viral evolution and escape - Science, 2021. [All Versions]. Natural language processing with two components: grammar (or syntax) and meaning (or semantics) for predicting which viral mutations may lead to viral escape.
2. Awesome Rust
Applications / Blockchain
- beerus (⭐244) - Beerus is a trustless StarkNet Light Client, ⚡blazing fast ⚡
3. Awesome Testing
Software / Make your life easier
- playwright-bdd (⭐204) - A module for running Behaviour-Driven Development (BDD) tests with Playwright runner.
4. Awesome Cpp
Regular Expression
- SRELL - Unicode-aware regular expression template library for C++. [BSD]
5. Awesome Gnome
Themes for non-GTK apps / Skeumorphic Icons
- Thunderbird GNOME Theme (⭐277) - Integrate Thunderbird into GNOME-based desktop using Adwaita.
6. Awesome Zsh Plugins
Plugins / superconsole - Windows-only
Themes / superconsole - Windows-only
- ksposh (⭐0) - Includes decorators for python virtual environment,
git
information, current directory and username.
7. Awesome Ai4lam
Tools and Frameworks / Document analysis, transcription, and labeling
- Callico – open-source web platform for document annotation
- Distributed Annotation 'n' Enrichment (DANE) (⭐3) – compute task assignment & file storage for automatic annotation of content (CLARIAH, Norway)
- HTRFLOW demo and associated GitHub repo (⭐24) – explore AI models for Handwritten Text Recogntion (Swedish National Archives)
- Label Studio – data labeling platform to fine-tune LLMs, prepare training data, or validate AI models
- OCR correction – OCR correction tools (Bibliothèque nationale, Luxembourg)
- Text models from the National Library of Sweden – available on Hugging Face
- Transkribus – transcription, recognition, & searching of historical documents
Tools and Frameworks / Audio and video analysis, transcription, and labeling
- Acoustic models from the National Library of Sweden – available on Hugging Face
- Audiovisual Metadata Platform (AMP) – generation of metadata for discovery & use of digital audio & video collections (Indiana U., USA)
- CAMPI – Computer-Aided Metadata Generation for Photo archives Initiative (Carnegie Mellonw U., USA)
- inaFaceAnalyzer (⭐17) – Python toolbox for face-based description of gender representation in media (Institut National de l'Audiovisuel, France)
- Newspaper Navigator – explore visual & textual content in the Chronicling America digitized newspaper collection (Library of Congress, USA)
- Oodi – virtual information assistant (Helsinki Central Library)
- ReTV – video analysis & summarization (Modul Univesrity, Austria)
Tools and Frameworks / Indexing and classification
- Annif and associated tutorial (⭐36) – tool for automated subject indexing and classification (National Library of Finland)
Tools and Frameworks / Search and retrieval
- GallicaPix – retrieval of heritage images (Bibliothèque nationale de France)
- GallicaSNOOP – framework for large-scale content-based image retrieval (Bibliothèque nationale de France)
- Maken Similarity Service – tools for alternative reading & finding similar photographs (National Library of Norway)
- Semantic search for Nasjonalmuseet’s online collection – open beta test (National Museum of Norway)
Tools and Frameworks / Applications of Transformers, LLMs, and GPT
- BERTopic – topic modeling technique that leverages Transformers and c-TF-IDF
- Chatbot for Luxembourgish newspapers – uses ChatGPT and understands French, German and English (Bibliothèque nationale de Luxembourg)
- Norwegian Transformer Model (NoTraM) (⭐109) – transformer model for Norwegian and Nordic languages (National Library of Norway)
- Swedish BERT (⭐137) – BERT model for the Swedish language (Royal Library of Sweden)
- Visual AI – open-world interpretable visual transformer (UK)
Datasets / Datasets available elsewhere
- Gensim datasets (⭐965) – repository of datasets for unstructured text processing
- HTR-United – datasets for training transcription or segmentation models
- nlp-datasets (⭐5.7k) – free/public domain datasets with text data for use in NLP
- Open Library data dumps – from the Internet Archive
- Registry of Open Data on AWS – datasets tagged by topic
Projects, Initiatives, and Case Studies / Project lists & directories
- List of Artificial Intelligence (AI) initiatives in museums – compiled in 2021 by Elena Villaespesa, Oonagh Murphy and Kate Nadel for the Museums+AI Network project.
- Projects in AI Registry (PAIR) – registry of AI projects in higher education (U. Oklahoma Libraries, USA)
Projects, Initiatives, and Case Studies / Select individual projects
- Vatican Manuscripts – machine transcription in the Vatican Secret Archive
Policies and recommendations / Statements by organizations and government bodies
Policies and recommendations / Surveys of policies and recommendations
- A cluster analysis of national AI strategies – Brookings Institute analysis of different countries’ national AI strategies, Dec. 2023
- A principled governance for emerging AI regimes: lessons from China, the European Union, and the United States by R. B. L. Dixon in AI and Ethics, 3, 793–810, 2023
- AI Governance Alliance: Briefing Paper Series – by the World Economic Forum, Jan. 2024
- AI policies across the globe: Implications and recommendations for libraries by L. S. Lo in IFLA Journal, 49(4), 645–649, 2023
- Principled Artificial Intelligence: Mapping Consensus in Ethical and Rights-Based Approaches to Principles for AI by Fjeld et al, Berkman Klein Center Research Publication No. 2020-1, 2020
- Prev: Jan 21, 2024
- Next: Jan 19, 2024