AI

Deep Learning and Collective Intelligence | Paper: Collective intelligence for deep learning: A survey of recent developments

This is a summary of the paper "Collective intelligence for deep learning: A survey of recent developments," which explores the relationship between deep learning and collective intelligence. Introduction Collective intelligence for deep l…

Understanding GPT-2 | Paper Summary: Language Models are Unsupervised Multitask Learners

This is a summary of the GPT-2 paper "Language Models are Unsupervised Multitask Learners." Introduction Language Models are Unsupervised Multitask Learners Overview Method Creating the WebText Training Dataset BPE: Byte Pair Encoding Mode…

Difference Between torch.tensor() and torch.Tensor() in PyTorch

This is a quick reference note on the differences between torch.tensor() and torch.Tensor(). Introduction Differences Between torch.tensor and torch.Tensor In Short In Detail Note: Creating Empty Tensors Conclusion References Introduction …

Behavior When Combining GPT Stream and Timeout | OpenAI Python Library

I'll verify what happens when calling the OpenAI API from Python with both stream and timeout settings enabled. Introduction Behavior When Combining GPT Stream and Timeout Hypothesis Testing Method Test Results Summary Conclusion Reference…

Implementing GPT Stream Responses with OpenAI API in Python

This article explains how to implement GPT streaming responses using the OpenAI API in Python. Introduction Implementing GPT Stream Responses with OpenAI API Conclusion References Introduction When you call GPT using the OpenAI API, the de…

Setting Timeout in the OpenAI Python Library

This is a guide on how to set timeouts when calling the OpenAI API using the OpenAI Python Library. Introduction Setting Timeout in the OpenAI Python Library How to: Using the request_timeout Parameter A Pitfall: The timeout Parameter Conc…

Understanding Organoid Intelligence | Paper Notes: Organoid intelligence (OI): the new frontier in biocomputing and intelligence-in-a-dish

This is a summary of the review paper "Organoid intelligence (OI): the new frontier in biocomputing and intelligence-in-a-dish," which covers developments in Organoid Intelligence (OI). Introduction Organoid intelligence (OI): the new fron…

Understanding the First GPT | Paper Notes: Improving Language Understanding by Generative Pre-Training

This is a summary of the first GPT paper, "Improving Language Understanding by Generative Pre-Training." Introduction Improving Language Understanding by Generative Pre-Training Overview Method Results Natural Language Inference Tasks Ques…

Reading the Transformer Paper: Attention Is All You Need

This is a summary of the seminal paper "Attention Is All You Need," which introduced the Transformer architecture. Introduction Attention Is All You Need Overview Method Model Architecture Training Method Results Translation Tasks Transfor…

Understanding Sakana.ai's Evolutionary Model Merging | Paper Notes: Evolutionary Optimization of Model Merging Recipes

This is a summary of the paper "Evolutionary Optimization of Model Merging Recipes," which describes Sakana.ai's evolutionary model merging approach. Introduction Evolutionary Optimization of Model Merging Recipes Overview Method Results L…

LLM and Brain Theory: Differences and Similarities with Active Inference

This article explores the differences and similarities between Active Inference from the Free Energy Principle and LLMs (Large Language Models), based on the paper "Predictive Minds: LLMs As Atypical Active Inference Agents." Introduction …

How to Read Research Papers: Learning from Andrew Ng

I'll share techniques for reading research papers from Andrew Ng's lecture video. Introduction How to Read Research Papers According to Andrew Ng Reading Papers Checking Your Understanding Conclusion References Introduction Recently, as I'…

Understanding 1-bit LLMs | Paper Notes: The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

This is a summary of the paper "The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits". Introduction The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Overview Method Results Conclusion/Thoughts References Intro…

Running Stable Diffusion on M2 Mac | Stable Diffusion web UI

I'll install Stable Diffusion web UI on my M2 Mac and get it up and running. Introduction Running Stable Diffusion web UI on M2 Mac 1. Installing Required Libraries 2. Clone the Stable Diffusion web UI Source Code 3. Place the Stable Diffu…

Auto-Commit with a Custom Slash Command in Claude Code

This post introduces a convenient way to automate git commit using a custom slash command in Claude Code. Introduction Auto-Commit with a Custom Slash Command in Claude Code How to Do It Example Conclusion References Introduction Writing a…

How to Create Custom Slash Commands in Claude Code

This post is a quick memo on how to create custom slash commands in Claude Code. Introduction What Are Custom Slash Commands in Claude Code? How to Create Custom Slash Commands Basic Syntax and Example Commands Prompt Only Arguments Runnin…

How to Delete All Chats in Claude

This is a quick memo on how to bulk delete all your chat history in Claude. Introduction How to Bulk Delete All Chats in Claude Steps Example Conclusion References Introduction Lately, I've been using Claude more often than ChatGPT. I coul…

How to Delete Images from ChatGPT Library

If you delete the chat where the image was generated, the corresponding image will also be deleted from the ChatGPT Library. Introduction How to Delete Images from ChatGPT Library Method Let’s Try It Conclusion Reference Introduction A new…

Combining GPT-4 Vision with Raspberry Pi | Making GPT the Brain of a Robot, Part 2

In this post, I’ll run a simple robot experiment on Raspberry Pi, combining a camera and servo motors with GPT-4 Vision. For Part 1, see: en.bioerrorlog.work Introduction Combining GPT-4 Vision with Raspberry Pi Concept Setup Hardware Soft…

Making LLM the Brain of a Robot - Part 1: Combining a Camera/Servo Motor with GPT

We'll conduct a simple experiment combining a camera, servo motors, and GPT on a Raspberry Pi. Introduction Combining GPT with Raspberry Pi, Camera, and Servo Motors Prototype Concept Setup Hardware Software Results Challenges Object Detec…

Shell Command to Combine Multiple Files for AI Coding

We will create a shell command to quickly combine the contents of multiple files, including directory structure information, for LLM prompts. Introduction Shell Command to Concatenate Multiple Files for AI Coding Goal Command Implementatio…

What the Underscore Suffix Means in PyTorch Function Names

It indicates in-place operations. Introduction Underscore "_" Suffix in PyTorch Function Names Meaning Example Conclusion References Introduction When working with PyTorch, you'll often see functions with a trailing underscore _ in their n…

Building a MCP Server in Go

This is a memo on how to implement an MCP server using Golang. Introduction Prerequisite: Is there an official Go MCP SDK? Implementing an MCP Server in Go The Target MCP Server Implementing the MCP server in Go Implementing the Tool Imple…

Streamlining MCP Server Development with MCP CLI

This guide summarizes how to use the MCP CLI. Introduction How to Use MCP CLI Installing MCP CLI List of MCP CLI Commands How to Use mcp run How to Use mcp dev How to Use mcp install Conclusion References Introduction MCP CLI is a tool inc…

How to Use MCP Inspector: A Testing Tool for MCP Servers

This is a quick guide on how to use "MCP Inspector," a browser-based test tool for MCP servers. Introduction How to Use MCP Inspector Preparation: Set Up a Minimal MCP Server Launching MCP Inspector Using MCP Inspector Conclusion Reference…

Letting LLM Play Minesweeper via MCP

Use MCP to let an LLM play Minesweeper. Introduction Playing Minesweeper via MCP Overview Starting the Minesweeper Server Building the MCP Server Connecting Claude Desktop with MCP Server Letting Claude Play Minesweeper Conclusion Referenc…

Connecting Claude Desktop to MCP Server: Step-by-Step Guide

This is a quick note to connect Claude Desktop to an MCP (Model Context Protocol) server. Introduction How to Connect Claude Desktop to an MCP Server 1. Add MCP Server Information to the Config File 2. Restart Claude Desktop Conclusion Ref…

AI Agent Patterns & Practices

Learn about AI agent patterns and practices from Anthropic’s article "Building effective agents" and its follow-up video. Introduction Background: What Is an AI Agent? AI Agent Patterns Augmented LLM Workflow: Prompt chaining Workflow: Rou…

Connecting GPT with MCP Server: MCP Tutorial

In this post, we'll connect GPT / OpenAI API with MCP (Model Context Protocol), and learn how MCP works. Introduction Recap: What is MCP? Connecting GPT to an MCP Server Implementing the MCP Server Implementing the MCP Client Running the M…

LLM-Based 3D Modeling in Blender: Trying Out MeshGen/LLaMA-Mesh

Exploring LLaMA-Mesh, an LLM that outputs 3D model data, and its Blender add-on, MeshGen. In addition, we'll also try using ChatGPT for 3D modeling. Introduction Understanding LLaMA-Mesh 3D Modeling with MeshGen Installing MeshGen in Blend…