Search: [ml] - David A. Windham

Vibe Coding is the Future

code · ai · ml · work

March 20, 2025 at 13:55:31 EDT * · permalink

https://www.youtube.com/watch?v=IACHfKmZMr8

This course focuses on efficient machine learning and systems. This is a crucial area as deep neural networks demand extraordinary levels of computation, hindering its deployment on everyday devices and burdening the cloud infrastructure. This course introduces efficient AI computing techniques that enable powerful deep learning applications on resource-constrained devices. Topics include model compression, pruning, quantization, neural architecture search, distributed training, data/model parallelism, gradient compression, and on-device fine-tuning. It also introduces application-specific acceleration techniques for large language models and diffusion models. Students will get hands-on experience implementing model compression techniques and deploying large language models (Llama2-7B) on a laptop.

ai · ml · learn

November 26, 2024 at 13:47:08 EST * · permalink

https://hanlab.mit.edu/courses/2024-fall-65940

mini cluster for ml

using a thunderbolt 5 bridge and https://github.com/mit-han-lab/TinyChatEngine

ml · ai

November 26, 2024 at 10:44:47 EST * · permalink

https://www.youtube.com/watch?v=GBR6pHZ68Ho

mem0ai/mem0

reader · ai · ml

August 19, 2024 at 07:43:44 EDT * · permalink

https://github.com/mem0ai/mem0

GPT-4 barrier has finally been broken

reader · ml · ai

March 9, 2024 at 08:49:45 EST * · permalink

https://simonwillison.net/2024/Mar/8/gpt-4-barrier/#atom-everything

Build the GPT Tokenizer

Andrej Karpathy - The Tokenizer is a necessary and pervasive component of Large Language Models (LLMs), where it translates between strings and tokens (text chunks).

ai · ml

February 22, 2024 at 06:50:15 EST * · permalink

https://www.youtube.com/watch?v=zduSFxRajkE

Gemini 1.5

reader · ai · ml

February 15, 2024 at 15:06:07 EST * · permalink

https://blog.google/technology/ai/google-gemini-next-generation-model-february-2024/

GPT in 500 lines of SQL

A complete GPT2 implementation as a single SQL query in PostgreSQL.

ai · ml · sql

January 8, 2024 at 08:14:41 EST * · permalink

https://explainextended.com/2023/12/31/happy-new-year-15/

Stuff we figured out about AI in 2023

reader · ai · ml

January 2, 2024 at 08:06:06 EST * · permalink

https://simonwillison.net/2023/Dec/31/ai-in-2023/

llamafile 🔥

Mozilla’s innovation group and Justine Tunney just released llamafile, and I think it’s now the single best way to get started running Large Language Models,

reader · ai · c · ml

November 29, 2023 at 18:40:29 EST * · permalink

http://simonwillison.net/2023/Nov/29/llamafile/#atom-everything

Approximate Nearest Neighbors

reader · db · vector · ai · ml

October 30, 2023 at 13:42:27 EDT * · permalink

https://zilliz.com/learn/approximate-nearest-neighbor-oh-yeah-ANNOY

Student Use Cases for AI

Harvard Business Publishing Education

ai · ml · education

September 27, 2023 at 14:24:50 EDT * · permalink

https://hbsp.harvard.edu/inspiring-minds/student-use-cases-for-ai

The Notetaking Cold War

Digging into the philosophical roots of the battle between Essentialists and Pragmatists

notes · thinking · til · ai · ml

August 26, 2023 at 11:41:32 EDT * · permalink

https://every.to/superorganizers/the-notetaking-cold-war-591898

Do we really need a specialized vector database?

reader · ai · ml · db

August 12, 2023 at 09:21:29 EDT * · permalink

https://modelz.ai/blog/pgvector

Shipping AI developer tools

on Github CoPilot

reader · ai · ml · dev

August 8, 2023 at 20:24:50 EDT * · permalink

https://github.blog/2023-08-08-a-guide-to-designing-and-shipping-ai-developer-tools/

Alpaca Eval Leaderboard

llm · ai · ml

July 27, 2023 at 20:48:25 EDT * · permalink

https://tatsu-lab.github.io/alpaca_eval/

Do LLMs know what they are talking about?

Large language models (LLMs) have only just emerged into mainstream thought, and already they’ve shown themselves to be a powerful tool for interacting with data. While some might classify them as merely a really cool new form of UI, others think that this may be the start of artificial general intelligence.

reader · ai · ml

July 6, 2023 at 10:08:22 EDT * · permalink

https://stackoverflow.blog/2023/07/03/do-large-language-models-know-what-they-are-talking-about/

Reddit Is OpenAI's Moat

Much has been written about the Reddit boycott recently, but I have kind of a wild take. what if we thought of Reddit as, functionally, subservient to OpenAI?

reader · openai · ml · ai · social

June 14, 2023 at 10:15:26 EDT * · permalink

https://www.cyberdemon.org/2023/06/14/reddit-moat.html

Obsidian-Copilot

What would a copilot for writing and thinking look like? To try answering this question, I built a prototype: Obsidian-Copilot

reader · obsidian · md · ai · ml · openai

June 14, 2023 at 10:05:01 EDT * · permalink

https://eugeneyan.com/writing/obsidian-copilot/

ChatGPT Plugins Don't Have PMF

In a now-taken-down blog post summarizing an event with Sam Altman, Altman revealed that he doesn’t believe that ChatGPT plugins have product-market fit (outside of the browsing plugin) and won’t be coming to the API soon. Why? A few hypotheses (not mutually exclusive). "Chat is not the right UX for plugins. If you know what you want to do, it’s often easier to just do a few clicks on the website. If you don’t, just a chat interface makes it hard to steer the model toward your goal."

ai · ml · gpt · openai

June 12, 2023 at 09:34:21 EDT * · permalink

https://matt-rickard.com/chatgpt-plugins-dont-have-pmf

Health system-scale language models

Clinical predictive models can help physicians and administrators make decisions by forecasting clinical and operational events.

ai · ml · healthcare

June 8, 2023 at 12:20:40 EDT * · permalink

https://www.nature.com/articles/s41586-023-06160-y

Why Chatbots Are Not the Future of Interfaces

Amelia Wattenberger makes a convincing argument for why chatbots are a terrible interface for LLMs.

chat · bots · ai · ml · ui

May 16, 2023 at 09:52:08 EDT * · permalink

https://wattenberger.com/thoughts/boo-chatbots

Introducing MPT-7B: A New Standard for Open-Source, Commercially Usable LLMs

Introducing MPT-7B, the latest entry in our MosaicML Foundation Series. MPT-7B is a transformer trained from scratch on 1T tokens of text and code. It is open source, available for commercial use, and matches the quality of LLaMA-7B. MPT-7B was trained on the MosaicML platform in 9.5 days with zero human intervention at a cost of ~$200k. Starting today, you can train, finetune, and deploy your own private MPT models, either starting from one of our checkpoints or training from scratch. For inspiration, we are also releasing three finetuned models in addition to the base MPT-7B: MPT-7B-Instruct, MPT-7B-Chat, and MPT-7B-StoryWriter-65k+, the last of which uses a context length of 65k tokens!

ml · llm · ai

May 6, 2023 at 18:36:40 EDT * · permalink

https://www.mosaicml.com/blog/mpt-7b

MosaicBERT: Pretraining BERT from Scratch for $20

With the MosaicBERT architecture + training recipe, you can now pretrain a competitive BERT-Base model from scratch on the MosaicML platform for $20. We’ve released the pretraining and finetuning code, as well as the pretrained weights.

ml · llm · ai

May 6, 2023 at 18:35:54 EDT * · permalink

https://www.mosaicml.com/blog/mosaicbert

What is a Vector Database?

Good basic explainer on vector databases.

reader · ai · ml · data

May 6, 2023 at 18:07:07 EDT * · permalink

https://www.pinecone.io/learn/vector-database/

Sam Altman: the Future of AI | Lex Fridman

Sam Altman is the CEO of OpenAI, the company behind GPT-4, ChatGPT, DALL-E, Codex, and many other state-of-the-art AI technologies.

ai · ml · podcast

April 30, 2023 at 11:05:58 EDT * · permalink

https://www.youtube.com/watch?v=L_Guz73e6fw

mayooear/gpt4-pdf-chatbot-langchain

Use the new GPT-4 api to build a chatGPT chatbot for multiple Large PDF files.

reader · gpt · ai · ml

April 21, 2023 at 16:23:15 EDT * · permalink

https://github.com/mayooear/gpt4-pdf-chatbot-langchain

LLM trained on 500k group chat messages

reader · ai · ml

April 12, 2023 at 13:20:36 EDT * · permalink

https://www.izzy.co/blogs/robo-boys.html

The Changelog podcast: LLMs break the internet

This time, we spent the whole episode talking about large language models: ChatGPT, GPT-4, Bing, Bard, Claude, LLaMA and more.

reader · podcast · ai · ml

April 10, 2023 at 14:40:58 EDT * · permalink

http://simonwillison.net/2023/Apr/8/llms-break-the-internet/#atom-everything

The Data Delusion - Jill Lepore

Long before the invention of the general-purpose computer, bureaucrats and researchers had begun gathering and cross-tabulating sets of numbers about populations—heights, weights, ages, sexes, races, political parties, incomes—using punch cards and tabulating machines. Sergey Brin said "Auto insurance companies analyse accident data and set insurance rates of individuals according to age, gender, vehicle type,” he pointed out. “If they were allowed to by law, they would also use race, religion, handicap, and any other attributes they find are related to accident rate.”

reader · ai · ml · ideas

April 3, 2023 at 08:41:26 EDT * · permalink

https://www.newyorker.com/magazine/2023/04/03/the-data-delusion

Talk with an LLaMA AI in your terminal

Talk with an LLaMA AI in your terminal / Port of OpenAI's Whisper model in C/C++. Contribute to ggerganov/whisper.cpp development by creating an account on GitHub.

ai · ml

March 29, 2023 at 11:10:21 EDT * · permalink

https://github.com/ggerganov/whisper.cpp/tree/master/examples/talk-llama

I gave GPT-4 a budget of $100 and told it to make as much money as possible.

Twitter thread from Jackson Fall on using gpt to create a website for making money.

ai · gpt · ml · spam

March 16, 2023 at 09:43:28 EDT * · permalink

https://twitter.com/jacksonfall/status/1636107218859745286

GPT-4 Technical Report

We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers.

ml · ai · gpt

March 16, 2023 at 09:24:32 EDT * · permalink

https://cdn.openai.com/papers/gpt-4.pdf

fka/awesome-chatgpt-prompts · Datasets at Hugging Face

Dataset Repository of Awesome ChatGPT Prompts from https://github.com/f/awesome-chatgpt-prompts

ai · ml · gpt

March 16, 2023 at 09:23:05 EDT * · permalink

https://huggingface.co/datasets/fka/awesome-chatgpt-prompts