Awesome List Updates on Jan 19, 2024
8 awesome lists updated today.
🏠 Home · 🔍 Search · 🔥 Feed · 📮 Subscribe · ❤️ Sponsor
1. Awesome Polars
Resources / Blog posts
- Great Tables: The Polars DataFrame Styler of Your Dreams - A post that shows how Great Tables package uses polars expressions to make delightful tables by @machow.
2. Awesome Ai4lam
Learning Resources / Natural language processing
- NLP course and associated GitHub repo (⭐9.6k) – by Elena Voita
- NLP accelerated class – by Machine Learning University
- Deep Learning for NLP – from Machine Learning Mastery
Learning Resources / Generative AI
- A Very Gentle Introduction to LLMs without the Hype – by Mark Riedl
- A brief introduction to GenAI – by U. Michigan MIDAS
3. Awesome Php
Table of Contents / API
- PackageGenerator (⭐422) - Package Generator generates a PHP SDK from any WSDL.
4. Awesome Scriptable
Tools
github-contributions (⭐0) - GitHub contributions heatmap on your lockscreen.
5. Awesome Web Archiving
Web Archiving Service Providers / Self-hostable, Open Source
- Conifer - From Rhizome, source available at https://github.com/Rhizome-Conifer.
Web Archiving Service Providers / Hosted, Closed Source
- Archive-It - From the Internet Archive.
6. Awesome Generative Deep Art
Generative AI history, timelines, maps, and definitions
- 60+ Generative AI Terms You Must Know By Heart: by Analytics Vidhya
AI Tools for Research / Multi-agents
7. Awesome Selfhosted
Software / Database Management
- AdminerEvo - Database management in a single PHP file. Available for MySQL, MariaDB, PostgreSQL, SQLite, MS SQL, Oracle, Elasticsearch, MongoDB and others (fork of Adminer). (Source Code (⭐522))
Apache-2.0/GPL-2.0
PHP
8. Awesome Azure Openai Llm
RAG Pipeline & Advanced RAG
- 9 Effective Techniques To Boost Retrieval Augmented Generation (RAG) Systems doc: ReRank, Prompt Compression, Hypothetical Document Embedding (HyDE), Query Rewrite and Expansion, Enhance Data Quality, Optimize Index Structure, Add Metadata, Align Query with Documents, Mixed Retrieval (Hybrid Search) [2 Jan 2024]
The Problem with RAG
- Seven Failure Points When Engineering a Retrieval Augmented Generation System: 1. Missing Content, 2. Missed the Top Ranked Documents, 3. Not in Context, 4. Not Extracted, 5. Wrong Format, 6. Incorrect Specificity, 7. Lack of Thorough Testing [11 Jan 2024]
LlamaIndex
- From Simple to Advanced RAG ref [10 Oct 2023]
Vector Database Comparison
- A Comprehensive Survey on Vector Database: Categorizes search algorithms by their approach, such as hash-based, tree-based, graph-based, and quantization-based. [18 Oct 2023]
Microsoft Azure OpenAI relevant LLM Framework / Lucene based search engine with OpenAI Embedding
- TaskWeaver (⭐5.1k): A code-first agent framework which can convert natural language user requests into executable code, with additional support for rich data structures, dynamic plugin selection, and domain-adapted planning process. [Sep 2023]
OpenAI's Roadmap and Products / OpenAI's plans according to Sam Altman
- Sam Altman reveals in an interview with Bill Gates (2 days ago) what's coming up in GPT-4.5 (or GPT-5): Potential integration with other modes of information beyond text, better logic and analysis capabilities, and consistency in performance over the next two years. ref [12 Jan 2024]
OpenAI's Roadmap and Products / OpenAI Products
- ChatGPT Plugin [23 Mar 2023]
- Introducing the GPT Store: Roll out the GPT Store to ChatGPT Plus, Team and Enterprise users GPTs [10 Jan 2024]
Trustworthy, Safe and Secure LLM / GPT series release date
- OpenAI Weak-to-strong generalization: In the superalignment problem, humans must supervise models that are much smarter than them. The paper discusses supervising a GPT-4 or 3.5-level model using a GPT-2-level model. It finds that while strong models supervised by weak models can outperform the weak models, they still don’t perform as well as when supervised by ground truth. git (⭐2.5k) [14 Dec 2023]
- A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models: A compre hensive survey of over thirty-two techniques developed to mitigate hallucination in LLMs [2 Jan 2024]
Build an LLMs from scratch: picoGPT and lit-gpt / GPT series release date
- Build a Large Language Model (From Scratch) (⭐25k):🏆Implementing a ChatGPT-like LLM from scratch, step by step
Evaluating Large Language Models / OSS Alternatives for OpenAI Code Interpreter (aka. Advanced Data Analytics)
- Evaluation Papers for ChatGPT (⭐451) [28 Feb 2023]
- Prev: Jan 20, 2024
- Next: Jan 18, 2024