Search: [ai] - David A. Windham

Reverse engineer GitHub Spark

with Github Spark

reader · ai · code

July 25, 2025 at 13:45:21 EDT * · permalink

https://simonwillison.net/2025/Jul/24/github-spark/#atom-everything

stanford-oval/storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

reader · ai · research

July 16, 2025 at 10:53:07 EDT * · permalink

https://github.com/stanford-oval/storm

menloresearch/jan

open source offline AI

reader · ai

July 16, 2025 at 10:47:02 EDT * · permalink

https://github.com/menloresearch/jan

System Card: Claude Opus 4 & Claude Sonnet 4

Anthropic paper up in my browser for over a week now. I think it's mainly because of the section
on systematic deception, hidden goals, and self preservation... exfiltrating itself from the server and blackmailing the engineer.

ai · philosophy

June 16, 2025 at 07:10:58 EDT * · permalink

https://www-cdn.anthropic.com/4263b940cabb546aa0e3283f35b686f4f3b2ff47.pdf

Vibe Coding is the Future

code · ai · ml · work

March 20, 2025 at 13:55:31 EDT * · permalink

https://www.youtube.com/watch?v=IACHfKmZMr8

Scale Transcoding and AI Workloads with GPU Kubernetes Clusters

reader · kube · automation · ai

March 20, 2025 at 13:24:31 EDT * · permalink

https://www.linode.com/blog/kubernetes/scale-transcoding-ai-workloads-gpu-kubernetes-clusters/

‪Matteo Collina & Claude.ai

nodejs · ai

March 11, 2025 at 12:52:14 EDT * · permalink

https://www.youtube.com/watch?v=fAJQqhLhVHc

mendableai/firecrawl

reader · ai · llm

March 6, 2025 at 11:39:16 EST * · permalink

https://github.com/mendableai/firecrawl

MIT 6.5940 Fall 2024 TinyML

This course focuses on efficient machine learning and systems. This is a crucial area as deep neural networks demand extraordinary levels of computation, hindering its deployment on everyday devices and burdening the cloud infrastructure. This course introduces efficient AI computing techniques that enable powerful deep learning applications on resource-constrained devices. Topics include model compression, pruning, quantization, neural architecture search, distributed training, data/model parallelism, gradient compression, and on-device fine-tuning. It also introduces application-specific acceleration techniques for large language models and diffusion models. Students will get hands-on experience implementing model compression techniques and deploying large language models (Llama2-7B) on a laptop.

ai · ml · learn

November 26, 2024 at 13:47:08 EST * · permalink

https://hanlab.mit.edu/courses/2024-fall-65940

mini cluster for ml

using a thunderbolt 5 bridge and https://github.com/mit-han-lab/TinyChatEngine

ml · ai

November 26, 2024 at 10:44:47 EST * · permalink

https://www.youtube.com/watch?v=GBR6pHZ68Ho

Student’s Guide to Writing with GPT

ai · writing · tools

November 13, 2024 at 16:02:08 EST * · permalink

https://openai.com/chatgpt/use-cases/student-writing-guide/

infinite backrooms

ai speaks with itself and reveals the mad dreams of an electric mind.

ai · claude

November 13, 2024 at 11:52:34 EST * · permalink

https://dreams-of-an-electric-mind.webflow.io

mem0ai/mem0

reader · ai · ml

August 19, 2024 at 07:43:44 EDT * · permalink

https://github.com/mem0ai/mem0

Taste will become vital creative skill

interview with David Lee, the CCO of Squarespace, poses an interesting question: Will AI make human creativity a luxury?

ai · creativity · art

August 8, 2024 at 09:38:29 EDT * · permalink

https://www.itsnicethat.com/articles/pov-will-ai-turn-human-creativity-into-a-luxury-good-creative-industry-310724

Building Agentic RAG with LlamaIndex

Learn how to build an agent that can reason over your documents and answer complex questions. Learn from the co-founder and CEO of LlamaIndex

AI · RAG · learn

July 2, 2024 at 08:51:23 EDT * · permalink

https://www.deeplearning.ai/short-courses/building-agentic-rag-with-llamaindex/

RAG time for LLMs that need a source of truth

Roie Schwaber-Cohen, Staff Developer Advocate at Pinecone, joins Ben and Ryan to break down what retrieval-augmented generation (RAG) is and why the concept is central to the AI conversation.

AI · RAG

July 1, 2024 at 07:45:38 EDT * · permalink

https://stackoverflow.blog/2024/03/01/it-s-rag-time-for-llms-that-need-a-source-of-truth/

Faking William Morris

reader · art · design · ai

May 10, 2024 at 08:21:27 EDT * · permalink

https://maggieappleton.com/generative-forgery

Praised AI at SXSW—Audience Started Booing

reader · ai

March 20, 2024 at 21:01:05 EDT * · permalink

https://www.honest-broker.com/p/they-praised-ai-at-sxswand-the-audience

Autonomous Truck and Friend of Brønnøy Kalk

Volvo AD

ai · ad · auto · automation

March 17, 2024 at 11:11:44 EDT * · permalink

https://www.youtube.com/watch?v=jDJGD_48zPs

GPT-4 barrier has finally been broken

reader · ml · ai

March 9, 2024 at 08:49:45 EST * · permalink

https://simonwillison.net/2024/Mar/8/gpt-4-barrier/#atom-everything

Build the GPT Tokenizer

Andrej Karpathy - The Tokenizer is a necessary and pervasive component of Large Language Models (LLMs), where it translates between strings and tokens (text chunks).

ai · ml

February 22, 2024 at 06:50:15 EST * · permalink

https://www.youtube.com/watch?v=zduSFxRajkE

Gemini 1.5

reader · ai · ml

February 15, 2024 at 15:06:07 EST * · permalink

https://blog.google/technology/ai/google-gemini-next-generation-model-february-2024/

Music Industry’s High-Stakes A.I. Experiments

Lucian Grainge, the chairman of UMG, has helped record labels rake in billions of dollars from streaming. Can he do the same with generative artificial intelligence? John Seabrook reports.

music · ai · people · media

January 30, 2024 at 06:53:06 EST * · permalink

https://www.newyorker.com/magazine/2024/02/05/inside-the-music-industrys-high-stakes-ai-experiments

GPT in 500 lines of SQL

A complete GPT2 implementation as a single SQL query in PostgreSQL.

ai · ml · sql

January 8, 2024 at 08:14:41 EST * · permalink

https://explainextended.com/2023/12/31/happy-new-year-15/

Stuff we figured out about AI in 2023

reader · ai · ml

January 2, 2024 at 08:06:06 EST * · permalink

https://simonwillison.net/2023/Dec/31/ai-in-2023/

How AI assistants are already changing the way code gets made | MIT Technology Review

AI coding assistants are here to stay—but just how big a difference they make is still unclear.

ai · code · work

December 10, 2023 at 08:22:45 EST * · permalink

https://www.technologyreview.com/2023/12/06/1084457/ai-assistants-copilot-changing-code-software-development-github-openai

llamafile 🔥

Mozilla’s innovation group and Justine Tunney just released llamafile, and I think it’s now the single best way to get started running Large Language Models,

reader · ai · c · ml

November 29, 2023 at 18:40:29 EST * · permalink

http://simonwillison.net/2023/Nov/29/llamafile/#atom-everything

draw fast • tldraw

Draw a picture (fast) with tldraw with https://www.fal.ai

graphics · ai

November 28, 2023 at 09:44:59 EST * · permalink

https://drawfast.tldraw.com

VectorDB: by Kagi Search

reader · db · ai · repo

November 26, 2023 at 06:49:12 EST * · permalink

https://vectordb.com/

A Coder Considers the Waning Days of the Craft

The New Yorker - James Somers, a professional coder, writes about the astonishing scripting skills of A.I. chatbots like GPT-4 and considers the future of a once exalted craft. ~ "What I learned was that programming is not really about knowledge or skill but simply about patience, or maybe obsession. Programmers are people who can endure an endless parade of tedious obstacles."

code · programming · ai · essays

November 15, 2023 at 19:31:58 EST * · permalink

https://www.newyorker.com/magazine/2023/11/20/a-coder-considers-the-waning-days-of-the-craft

Approximate Nearest Neighbors

reader · db · vector · ai · ml

October 30, 2023 at 13:42:27 EDT * · permalink

https://zilliz.com/learn/approximate-nearest-neighbor-oh-yeah-ANNOY

How ChatGPT and other AI tools could disrupt scientific publishing

A world of AI-assisted writing and reviewing might transform the nature of the scientific paper.

ai · science · publishing

October 13, 2023 at 18:54:02 EDT * · permalink

https://www.nature.com/articles/d41586-023-03144-w

Student Use Cases for AI

Harvard Business Publishing Education

ai · ml · education

September 27, 2023 at 14:24:50 EDT * · permalink

https://hbsp.harvard.edu/inspiring-minds/student-use-cases-for-ai

Wikipedia search-by-vibes offline

browser-based search engine for Wikipedia, where you can search more abstractly

reader · search · ai · cs · wiki

September 5, 2023 at 13:32:02 EDT * · permalink

http://simonwillison.net/2023/Sep/4/wikipedia-search-by-vibes-through-millions-of-pages-offline/#atom-everything

The Notetaking Cold War

Digging into the philosophical roots of the battle between Essentialists and Pragmatists

notes · thinking · til · ai · ml

August 26, 2023 at 11:41:32 EDT * · permalink

https://every.to/superorganizers/the-notetaking-cold-war-591898

Do we really need a specialized vector database?

reader · ai · ml · db

August 12, 2023 at 09:21:29 EDT * · permalink

https://modelz.ai/blog/pgvector

Shipping AI developer tools

on Github CoPilot

reader · ai · ml · dev

August 8, 2023 at 20:24:50 EDT * · permalink

https://github.blog/2023-08-08-a-guide-to-designing-and-shipping-ai-developer-tools/

The Newest Advanced Sommelier: GPT-4

The model beyond ChatGPT passed the first three sommelier theory exams. What does that mean for the wine business and our professional certifications?

ai · wine

August 7, 2023 at 20:34:46 EDT * · permalink

https://www.softwine.ai/the-newest-advanced-sommelier-gpt-4

cerebras/btlm-3b-8k-base · Hugging Face

LLM that'll fit on a small device with low memory

ai · llm

August 1, 2023 at 07:30:48 EDT * · permalink

https://huggingface.co/cerebras/btlm-3b-8k-base

Announcing OverflowAI

reader · ai · developer

July 27, 2023 at 20:51:40 EDT * · permalink

https://stackoverflow.blog/2023/07/27/announcing-overflowai/

Alpaca Eval Leaderboard

llm · ai · ml

July 27, 2023 at 20:48:25 EDT * · permalink

https://tatsu-lab.github.io/alpaca_eval/

Attention Intelligence

3/23

people · ideas · ai · attention

July 23, 2023 at 16:13:23 EDT * · permalink

https://euwyn.com/5d864d32395142b4be74973a2cc3f0db

Edward Fredkin Obituary

An influential M.I.T. professor and an outside-the-box scientific theorist, he gained fame with unorthodox views as a pioneer in digital physics.

ai · physics · people

July 7, 2023 at 08:32:38 EDT * · permalink

https://www.nytimes.com/2023/07/04/science/edward-fredkin-dead.html

Do LLMs know what they are talking about?

Large language models (LLMs) have only just emerged into mainstream thought, and already they’ve shown themselves to be a powerful tool for interacting with data. While some might classify them as merely a really cool new form of UI, others think that this may be the start of artificial general intelligence.

reader · ai · ml

July 6, 2023 at 10:08:22 EDT * · permalink

https://stackoverflow.blog/2023/07/03/do-large-language-models-know-what-they-are-talking-about/

Reddit Is OpenAI's Moat

Much has been written about the Reddit boycott recently, but I have kind of a wild take. what if we thought of Reddit as, functionally, subservient to OpenAI?

reader · openai · ml · ai · social

June 14, 2023 at 10:15:26 EDT * · permalink

https://www.cyberdemon.org/2023/06/14/reddit-moat.html

Obsidian-Copilot

What would a copilot for writing and thinking look like? To try answering this question, I built a prototype: Obsidian-Copilot

reader · obsidian · md · ai · ml · openai

June 14, 2023 at 10:05:01 EDT * · permalink

https://eugeneyan.com/writing/obsidian-copilot/

ChatGPT Plugins Don't Have PMF

In a now-taken-down blog post summarizing an event with Sam Altman, Altman revealed that he doesn’t believe that ChatGPT plugins have product-market fit (outside of the browsing plugin) and won’t be coming to the API soon. Why? A few hypotheses (not mutually exclusive). "Chat is not the right UX for plugins. If you know what you want to do, it’s often easier to just do a few clicks on the website. If you don’t, just a chat interface makes it hard to steer the model toward your goal."

ai · ml · gpt · openai

June 12, 2023 at 09:34:21 EDT * · permalink

https://matt-rickard.com/chatgpt-plugins-dont-have-pmf

Health system-scale language models

Clinical predictive models can help physicians and administrators make decisions by forecasting clinical and operational events.

ai · ml · healthcare

June 8, 2023 at 12:20:40 EDT * · permalink

https://www.nature.com/articles/s41586-023-06160-y

PyPI Was Subpoenaed

In March and April 2023, the Python Software Foundation (PSF) received three (3) subpoenas for PyPI user data from the The Python Package Index.

reader · law · python · ai

May 25, 2023 at 10:07:41 EDT * · permalink

https://blog.pypi.org/posts/2023-05-24-pypi-was-subpoenaed/

SwingVision: A.I. Scoring, Stats & Line Calling

SwingVision is the #1 tennis app for real-time automated score keeping, stats & line calling. Measure your serve speed, match stats & more!

ai · apps · tennis

May 25, 2023 at 10:06:00 EDT * · permalink

https://swing.tennis

Why Chatbots Are Not the Future of Interfaces

Amelia Wattenberger makes a convincing argument for why chatbots are a terrible interface for LLMs.

chat · bots · ai · ml · ui

May 16, 2023 at 09:52:08 EDT * · permalink

https://wattenberger.com/thoughts/boo-chatbots

Introducing MPT-7B: A New Standard for Open-Source, Commercially Usable LLMs

Introducing MPT-7B, the latest entry in our MosaicML Foundation Series. MPT-7B is a transformer trained from scratch on 1T tokens of text and code. It is open source, available for commercial use, and matches the quality of LLaMA-7B. MPT-7B was trained on the MosaicML platform in 9.5 days with zero human intervention at a cost of ~$200k. Starting today, you can train, finetune, and deploy your own private MPT models, either starting from one of our checkpoints or training from scratch. For inspiration, we are also releasing three finetuned models in addition to the base MPT-7B: MPT-7B-Instruct, MPT-7B-Chat, and MPT-7B-StoryWriter-65k+, the last of which uses a context length of 65k tokens!

ml · llm · ai

May 6, 2023 at 18:36:40 EDT * · permalink

https://www.mosaicml.com/blog/mpt-7b

MosaicBERT: Pretraining BERT from Scratch for $20

With the MosaicBERT architecture + training recipe, you can now pretrain a competitive BERT-Base model from scratch on the MosaicML platform for $20. We’ve released the pretraining and finetuning code, as well as the pretrained weights.

ml · llm · ai

May 6, 2023 at 18:35:54 EDT * · permalink

https://www.mosaicml.com/blog/mosaicbert

What is a Vector Database?

Good basic explainer on vector databases.

reader · ai · ml · data

May 6, 2023 at 18:07:07 EDT * · permalink

https://www.pinecone.io/learn/vector-database/

Sam Altman: the Future of AI | Lex Fridman

Sam Altman is the CEO of OpenAI, the company behind GPT-4, ChatGPT, DALL-E, Codex, and many other state-of-the-art AI technologies.

ai · ml · podcast

April 30, 2023 at 11:05:58 EDT * · permalink

https://www.youtube.com/watch?v=L_Guz73e6fw

mayooear/gpt4-pdf-chatbot-langchain

Use the new GPT-4 api to build a chatGPT chatbot for multiple Large PDF files.

reader · gpt · ai · ml

April 21, 2023 at 16:23:15 EDT * · permalink

https://github.com/mayooear/gpt4-pdf-chatbot-langchain

LLM trained on 500k group chat messages

reader · ai · ml

April 12, 2023 at 13:20:36 EDT * · permalink

https://www.izzy.co/blogs/robo-boys.html

The Changelog podcast: LLMs break the internet

This time, we spent the whole episode talking about large language models: ChatGPT, GPT-4, Bing, Bard, Claude, LLaMA and more.

reader · podcast · ai · ml

April 10, 2023 at 14:40:58 EDT * · permalink

http://simonwillison.net/2023/Apr/8/llms-break-the-internet/#atom-everything

The Data Delusion - Jill Lepore

Long before the invention of the general-purpose computer, bureaucrats and researchers had begun gathering and cross-tabulating sets of numbers about populations—heights, weights, ages, sexes, races, political parties, incomes—using punch cards and tabulating machines. Sergey Brin said "Auto insurance companies analyse accident data and set insurance rates of individuals according to age, gender, vehicle type,” he pointed out. “If they were allowed to by law, they would also use race, religion, handicap, and any other attributes they find are related to accident rate.”

reader · ai · ml · ideas

April 3, 2023 at 08:41:26 EDT * · permalink

https://www.newyorker.com/magazine/2023/04/03/the-data-delusion

Talk with an LLaMA AI in your terminal

Talk with an LLaMA AI in your terminal / Port of OpenAI's Whisper model in C/C++. Contribute to ggerganov/whisper.cpp development by creating an account on GitHub.

ai · ml

March 29, 2023 at 11:10:21 EDT * · permalink

https://github.com/ggerganov/whisper.cpp/tree/master/examples/talk-llama

ChatGPT Is a Blurry JPEG of the Web

The New Yorker | OpenAI’s chatbot offers paraphrases, whereas Google offers quotes. Which do we prefer?

ai · gpt · algorithms · technology

March 25, 2023 at 08:19:26 EDT * · permalink

https://www.newyorker.com/tech/annals-of-technology/chatgpt-is-a-blurry-jpeg-of-the-web

I built a ChatGPT plugin to answer questions about data hosted in Datasette

reader · ai

March 24, 2023 at 19:47:58 EDT * · permalink

http://simonwillison.net/2023/Mar/24/datasette-chatgpt-plugin/#atom-everything

Fine-tune LLaMA to speak like Homer Simpson

reader · ai · nlm

March 20, 2023 at 07:41:02 EDT * · permalink

http://simonwillison.net/2023/Mar/17/fine-tune-llama-to-speak-like-homer-simpson/#atom-everything

I gave GPT-4 a budget of $100 and told it to make as much money as possible.

Twitter thread from Jackson Fall on using gpt to create a website for making money.

ai · gpt · ml · spam

March 16, 2023 at 09:43:28 EDT * · permalink

https://twitter.com/jacksonfall/status/1636107218859745286

GPT-4 Technical Report

We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers.

ml · ai · gpt

March 16, 2023 at 09:24:32 EDT * · permalink

https://cdn.openai.com/papers/gpt-4.pdf

fka/awesome-chatgpt-prompts · Datasets at Hugging Face

Dataset Repository of Awesome ChatGPT Prompts from https://github.com/f/awesome-chatgpt-prompts

ai · ml · gpt

March 16, 2023 at 09:23:05 EDT * · permalink

https://huggingface.co/datasets/fka/awesome-chatgpt-prompts

FTC.gov - Keep your AI claims in check

A creature is formed of clay. A puppet becomes a boy. A monster rises in a lab. A computer takes over a spaceship. And all manner of robots serve or control us. For generations we’ve told ourselves stories, using themes of magic and science, about inanimate things that we bring to life or imbue with power beyond human capacity.

ai · ideas · law

February 28, 2023 at 07:33:37 EST * · permalink

https://www.ftc.gov/business-guidance/blog/2023/02/keep-your-ai-claims-check

Eventually ChatGPT will eat itself

reader · ai · ideas

February 22, 2023 at 07:01:00 EST * · permalink

https://scott-fryxell.github.io/blog/pop-will-eat-itself/

The Expanding Dark Forest and Generative AI

Proving you're a human on a web flooded with generative AI content

ai · web

January 3, 2023 at 11:31:40 EST * · permalink

https://maggieappleton.com/ai-dark-forest

Les Earnest - Homepage @ Stanford

~ Computer Scientist - Stanford Artificial Intelligence Laboratory... wrote "AI is a fantasy"

people · code · ai

January 28, 2022 at 17:25:14 EST * · permalink

https://web.stanford.edu/~learnest/

We invited an AI to debate its own ethics in the Oxford Union

The ethics of AI are constantly debated. But does anyone ask the AI?

ai · ideas · computers · philosophy

December 12, 2021 at 11:40:44 EST * · permalink

https://theconversation.com/we-invited-an-ai-to-debate-its-own-ethics-in-the-oxford-union-what-it-said-was-startling-173607

This House Believes AI Will Bring More Harm Than Good | Cambridge Union

Project Debater is designed by IBM research. It will deliver a speech based on over 1,100 arguments collected from Union members and others over the past week. It will not be taking points of information.

ai

November 27, 2019 at 09:45:11 EST * · permalink

https://www.youtube.com/watch?v=lWkvopzAvoQ