All Notes: Nonlinear Function

All Notes

psychedelic

[ 5-MeO-DMT ] [ mescaline ] [ psilocybin ]

Modified: October 04, 2021.

purchases I recommend

Toilet: A bidet. Cold water, warm-water (if a hose from your toilet can reach your sink's plumbing), or internally heated. It saves toilet…

Modified: September 25, 2021.

purpose

Modified: June 27, 2021.

pushforward natural gradient

It's tempting to use [ natural gradient ] ascent to optimize a variational distribution. We could also consider using it to optimize the…

Modified: October 25, 2020.

put-call parity

A portfolio containing a long (European) call and short (European) put [ option ] with the same strike price and expiry date is equivalent…

Modified: October 26, 2021.

pyrrole ring

A five-sided carbon ring with one nitrogen: C4H4NH.

Modified: May 14, 2021.

python project setup

General procedure for setting up a new Python project. Create a new git repo and clone into a directory my_new_project Add files…

Modified: April 09, 2022.

qualia

Modified: .

quiet desperation

Modified: .

quotes

"Any one who considers arithmetical methods of producing random digits is, of course, in a state of sin." - John von Neumann Young man, in…

Modified: February 11, 2022.

random variable

Formally, a random variable is a (measurable) function defined on outcomes from a [ probability space ] . That is, in any possible…

Modified: August 27, 2022.

randomized controlled trial

a powerful tool for establishing [ causality ]

Modified: February 07, 2022.

rate equation

The rate equation or master equation for a continuous-time Markov [ stochastic process ] describes how the probability density of the…

Modified: August 28, 2022.

rationality is moral

From a [ utilitarian ] perspective, all of morality follows from improving global utility, and it follows that it'd be better to do this…

Modified: June 07, 2021.

reading inbox

In no particular order. Items may move to [ previously read ] if I read them or former reading inbox if I decide I'm not currently…

Modified: August 28, 2023.

reading is processing

One model you could have of reading a book is that the book contains information, and once you've read it, you now possess that information…

Modified: February 23, 2020.

reality tunnel

Modified: February 18, 2020.

reasons to write

Why do I want to write more? Because: writing forces thoughts to crystallize. It forces me to draw conclusions about what I believe and who…

Modified: May 16, 2022.

recipe for ruin

[ Nielsen's notes on ASI xrisk ] introduced the thought experiment: If you ask an all-knowing oracle a question like "Can you give me a…

Modified: September 29, 2023.

recipes

See also [ family recipes ]. Roast chicken and vegetables: preheat oven to ~400. cover a spatchcocked chicken with salted garlic butter at…

Modified: March 03, 2022.

recruiting

The best way to recruit people is to convince them that they will learn and grow by working with your team. Pitches that have 'worked' for…

Modified: July 19, 2021.

regularization

Modified: .

reinforced self-training

thoughts on reinforced self-training paper: https://arxiv.org/abs/2308.08998 the basic idea is very simple. we sample additional…

Modified: October 24, 2024.

reinforcement learning

Note : see [ reinforcement learning notation ] for a guide to the notation I'm attempting to use through my RL notes. Three paradigmatic…

Modified: April 23, 2022.

reinforcement learning advice

https://andyljones.com/posts/rl-debugging.html https://www.reddit.com/r/reinforcementlearning/comments/9sh77q/what_are_your_best_tips_for…

Modified: March 28, 2022.

reinforcement learning from human feedback

see: [ steering language models ], [ direct preference optimization ] We are given a bunch of pairwise preference evaluations, of the form…

Modified: .

reinforcement learning notation

There tends to be a lot going on in RL algorithms, with a whole mess of different quantities defined across timesteps. It's useful to try to…

Modified: April 23, 2022.

relationship

[ relationship advice ]

Modified: February 10, 2022.

relationship advice

see also (maybe combine with?) [ relationship ] Accept [ bids ] as much as possible. Praise your partner in public (and in private). Stay in…

Modified: July 13, 2020.

religion

Modified: December 01, 2022.

relu inequality

Suppose we want a [ transformer ] to evaluate the inequality returning if and otherwise. For integer , this can be done with a…

Modified: February 13, 2023.

relu selection

The selection operation y = where(c, a, b) returns How can a [ transformer ] layer implement this operation? One approach is to is to use…

Modified: February 12, 2023.

remember arguments

When I was younger---in college or in grad school---I was sometimes conflicted about whether I should prioritize trying to get to correct…

Modified: February 11, 2022.

reparameterization trick

Modified: .

replica trick

If a model with data has normalizing constant , then the replica trick says that This allows us to analyze the average log-normalizer…

Modified: October 22, 2022.

representation

In modern ML, representation learning is the art of trying to find useful abstractions, embodied as encoding networks. We can learn…

Modified: February 11, 2022.

research community

To be a successful researcher it's incredibly important to find and join your [ research community ]. Go to conferences (especially to small…

Modified: February 25, 2022.

research idea

This note lists some ideas and directions for research I'm interested in or excited about. Some are more fleshed out than others, some more…

Modified: February 21, 2023.

research identity

Modified: .

research worth doing

(see also: [ impact ]) I've been feeling depressed partly because the actual PhD research I did was (in my view) pointless, and more broadly…

Modified: February 11, 2022.

researchers don't always know best

People who do research have a very ground-level, zoomed-in view of their field. They know where the current obstacles are, how incredibly…

Modified: January 16, 2021.

reservoir sampling

Reservoir samplers solve the following task: sample items without replacement from a stream of unknown length . Because the length is…

Modified: May 10, 2022.

retreats

Teachers or centers I'd be interested to do a retreat with/at: Tucker Peck Michael Taft Tina Rasmussen (Cloud Mountain 13-day retreats…

Modified: August 27, 2021.

reversal curse

References: The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A" https://arxiv.org/abs/2309.12288 Studying Large Language…

Modified: January 10, 2024.

reverse diffusion

References: Ludwig Winkler's post on Reverse time stochastic differential equations . Suppose we have a [ stochastic differential equation…

Modified: August 27, 2022.

reward

stray thoughts about reward functions (probably related to the [ agent ] abstraction and the [ intentional stance ]) one can make a…

Modified: April 06, 2023.

reward funnel

When thinking about the [ reward ] function for a real-world AI system, there is always some causal process that determines reward. For…

Modified: April 12, 2023.

reward is enough

Silver, Singh, Precup, and Sutton argue that Reward is enough : maximizing a reward signal implies, on its own, a very broad range of…

Modified: March 02, 2022.

reward shaping

Suppose we have a [ Markov decision process ] in which we get reward only at the very end of a long trajectory. Until that point, we have no…

Modified: March 03, 2022.

reward uncertainty

See also: [ cooperative inverse reinforcement learning ], [ love is value alignment ]

Modified: June 12, 2021.

right effort

four [ right effort ]s: Restraint : avoid unwholesome situations that might give rise to or trigger unwholesome states and patterns. For…

Modified: October 03, 2024.

ring attention

References: Liu, Zaharia, Abbeel. Ring Attention with Blockwise Transformers for Near-Infinite Context (2023). https://arxiv.org/abs/231…

Modified: February 19, 2024.

rl diagnostics

Things that might be useful to log in a [ reinforcement learning ] algorithm: Return of each trajectory. (summarize as mean/std/min/max…

Modified: April 11, 2022.

rl goals

Implement MuZero or something similar. What are the 'state of the art' RL algorithms? What is known and not known about [ value alignment ]?

Modified: February 21, 2022.

rl with proxy objectives

Suppose we want to maximize reward, but we only get a couple bits of reward data every few hundreds/thousands of actions, whereas we get…

Modified: March 03, 2022.

rocket equation

Deriving here just for my own edification. At each timestep a rocket ejects mass at velocity relative to its current reference frame. At…

Modified: September 06, 2022.

romance is twenty times harder for gay people

About 5% of people are gay, so in any given community it's about twenty times harder for a gay person to find a partner than for a straight…

Modified: May 22, 2021.

rubber-duck debugging

Modified: February 14, 2021.

sacred and profane

SuccessfulFriend highlighted this distinction which I should really read more about. At a high level it's about the distinction between…

Modified: July 13, 2020.

safe objective

Language is a really natural way to tell AI systems what we want them to do. Some current examples: [ GPT ]-3 and successors (InstructGPT…

Modified: April 07, 2022.

salt

In chemistry, a salt is a neutral-ish (not too acidic nor basic) compound held together by an [ ionic bond ]. Salts can be formed by [ acid…

Modified: January 22, 2022.

samskara

Modified: .

scale-free alignment

Modified: .

scheduled sampling

Scheduled sampling is a training procedure for sequence models that attempts to mitigate [ exposure bias ] - the problem in which generation…

Modified: October 13, 2022.

score function

The score function is the gradient of a log-density with respect to its parameters: It is the direction that we would move the parameters…

Modified: July 21, 2022.

score matching

Aapo Hyvärinen: Estimation of Non-Normalized Statistical Models by Score Matching (2005) https://jmlr.org/papers/volume6/hyvarinen05a…

Modified: .

self-aware

Modified: February 22, 2022.

self-confidence

[ unearned confidence ] [ agency and confidence ]

Modified: February 07, 2022.

self-love

https://forum.effectivealtruism.org/posts/QhPyQTXuGt58Nzxnu/you-are-probably-underestimating-how-good-self-love-can-be https://twitter.com…

Modified: October 24, 2024.

self-other boundary

Modified: .

selfishness is moral

Sometimes it's necessary and right to prioritize my own interests, even if [ global utility ] is ultimately the only metric. Developing…

Modified: March 07, 2022.

selflessness is rational

I need to genuinely care about other people and want the best for them, both in general, and for specific people in my life. Why? Obviously…

Modified: April 01, 2022.

sense gate

Traditional Buddhism describes six "sense bases" or gates: the eye, ear, nose, tongue, body, and mind. Western science usually omits the…

Modified: .

sense of self

When we talk about "the self", or having a "sense of self", what do we mean? There is an interpretation in terms of [ consciousness ] - that…

Modified: February 03, 2025.

sense of the possible

Take the statement 'human-level AI is possible'. As a kid, I saw this as obviously true. We can simulate physics, and brains are physical…

Modified: January 23, 2022.

serotonin

There are 14 kinds of serotonin receptors; most (but not all) are [ G protein ]-coupled. The central nervous system has almost all of them…

Modified: May 14, 2021.

shadow

Shadow work means, roughly speaking, a practice of noticing, loving, and integrating the parts of yourself that you've repressed (your…

Modified: November 07, 2023.

shard theory

Shard theory's basic ontology of RL holds that shards are contextually activated, behavior-steering computations in neural networks…

Modified: .

side channel

Modified: .

sigma-algebra

Modified: .

simulator AI

References: https://generative.ink/posts/simulators/ It seems pretty clear that the intelligence emerging from [ language model ]s is not…

Modified: February 16, 2023.

single-index model

The performance of an investment can be modeled as where the 'market return' is that of some sufficiently broad index such as the S&P 50…

Modified: November 30, 2022.

skinship

In Korea (and maybe also Japan?) it's common for young guys to bond through physical touch and affection: hugging, holding hands, sitting in…

Modified: February 06, 2023.

sleep

It's weird that we lie down every day to cease our consciousness, and sometimes to hallucinate. There are physiological benefits to sleep…

Modified: July 19, 2022.

small steps

It's not a terrible summation of [ depression ] that it starts from seeing no way to achieve your goals. Sometimes that's because it's…

Modified: May 16, 2022.

soft actor-critic

Modified: .

software engineering

Modified: May 08, 2020.

software lessons from TFP

I've been really unhappy about how TFP is developed. It's felt pedantic. I waste a lot of effort thinking about things I don't want to think…

Modified: April 10, 2021.

southeast asia travel tips

This is the advice I wish I'd had. It's catered to my preferences; caveat emptor. Packing: (for men) bring one pair of long pants, for…

Modified: May 13, 2022.

sparse coding

paper 1: http://redwood.berkeley.edu/bruno/papers/VR.pdf basic idea: find a basis such that any given image (or whatever signal) can be…

Modified: .

sparse distributed memory

References: https://redwood.berkeley.edu/wp-content/uploads/2020/08/KanervaP_SDMrelated_models1993.pdf A sparse distributed memory consists…

Modified: March 29, 2024.

sparse mixture of experts

References: Jacobs, Jordan, Nowlan, Hinton. Adaptive Mixtures of Local Experts (1991) Shazeer et al. Outrageously Large Neural Networks…

Modified: February 13, 2023.

speech becomes less free

I used to be able to say 'superintelligent AI is possible'. Now in industry the notion of 'possible' is 'something I can myself do': by…

Modified: February 23, 2020.

spiritual joy

What would true wireheading feel like? People have this impression that it'd be thin, exhausting, artificial, ultimately isolating and not…

Modified: March 24, 2023.

spiritual path

Modified: .

stablecoin

A stablecoin is a dollar-denominated liability registered on a [ blockchain ]. It can be backed by USD reserves, as Tether allegedly is…

Modified: March 13, 2022.

starting principle

Julian Shapiro recommends keeping a set of six 'Starting principles' that you use to make decisions. That's about all that you can…

Modified: February 07, 2022.

state values, then action values

A common pattern in [ reinforcement learning ] pedagogy is to develop some idea first in the context of estimating state values , and then…

Modified: March 29, 2022.

stationary

A [ stochastic process ] is (strictly) stationary if all of its joint distributions are invariant under time displacement. It is wide…

Modified: August 28, 2022.

staying up late

I've always been an evening person more than a [ morning person ]. I often stay up until 1 or 2am, and in the absence of hard constraints it…

Modified: March 14, 2023.

steering language models

Getting language models to align their output with human preferences would be highly useful for [ computational life coach ]ing. What's the…

Modified: July 18, 2021.

stochastic differential equation

SDEs are typically written in terms of the differential of a Weiner process (Brownian motion), e.g., Although Weiner processes are nowhere…

Modified: August 29, 2022.

stochastic gradient

Modified: .

stochastic process

A stochastic process is a collection of [ random variable ]s defined on a common [ probability space ] . Equivalently, it is a joint…

Modified: August 27, 2022.

stoicism

Pasting a quote from Adam Smith by way of HN (source http://www.econlib.org/library/Smith/smMS7.html , I should read the whole thing…) that…

Modified: February 10, 2022.

stopping time

A stopping time for a stochastic process is a time-valued That is, integer-valued for discrete-time processes and real-valued for…

Modified: August 27, 2022.

strange loops

Modified: .

strong opinion weakly held

Modified: March 14, 2022.

structural equation model

Modified: August 02, 2021.

structural motive

A lot of confused discussion around large organizations comes from conflating individual motivations with larger-scale 'structural…

Modified: December 14, 2022.

structured prediction

In kindergarten stats, you learn how to build a model that takes in data (a feature vector, image, sound file, etc) and predicts a single…

Modified: March 03, 2022.

stupid ideas are good ideas

Revolutionary ideas must live in the blind spots of the current intellectual conversation; otherwise people would already be using them…

Modified: January 24, 2022.

style guide

Note naming The general goal is to minimize the use of aliasing in links. In case where these guidelines suggest an unnatural or uncommon…

Modified: July 22, 2022.

substantive questions I've had

substantive questions I've had these are things I've wondered about that were never answered properly in the classes in which I learned them…

Modified: February 17, 2022.

substituted tryptamines

Substituted tryptamines - PsychonautWiki Tryptamine consists of an [ indole ] moeity plus a two-carbon (ethyl) chain with an amine group. We…

Modified: May 14, 2021.

suffering

Modified: February 25, 2022.

sufficient statistic

Modified: .

sugar

A sugar is any molecule with the empirical formula C(N)H(2N)O(N). These are like alkanes, which are C(N)H(2N + 2), except that each carbon…

Modified: February 10, 2022.

superposition

A -dimensional vector can represent distinct orthogonal features, but due to the weirdness of [ high-dimension ]al geometry, it can…

Modified: September 14, 2022.

surprises in having a job

What have I learned in 2.5 years at Google? What did I not realize? The model of research. How low the expectations are. How fake it felt to…

Modified: July 10, 2020.

sybil attack

Modified: October 03, 2021.

symmetry theory of valence

Refs: https://opentheory.net/Qualia_Formalism_and_a_Symmetry_Theory_of_Valence.pdf

Modified: August 06, 2023.

syncing supernote with surface pro x

My Supernote A5X syncs through Dropbox, but unfortunately Dropbox doesn't support Windows ARM64 machines like the Surface Pro X. Here's my…

Modified: August 27, 2022.

taṇhā

Buddhist (Pali) term referring to craving, longing, desire for the world to be other than as it is. This includes craving good things and…

Modified: September 03, 2022.

talk about people

There's a famous quote attributed to Eleanor Roosevelt: "great minds discuss ideas, average ones events, mediocre ones discuss people". This…

Modified: February 10, 2022.

tantra

A set of methods for maintaining an " attitude of spacious passion ". The particular methods are contingent; if you could maintain the…

Modified: March 23, 2022.

target network

A general issue with [ temporal difference ] learning methods, which 'update a guess towards a guess', is that they can end up 'chasing…

Modified: April 23, 2022.

teacher forcing

Something that confused for me for a while is that people in certain communities talk about 'teacher forcing' as though it's a trick or a…

Modified: October 13, 2022.

teaching

Dave's principles of effective teaching. Motivation is by far the most important thing. A student who wants to learn will learn even with a…

Modified: April 11, 2020.

teaching at the critical point

As a researcher, I wonder if there's a 'critical point' of growing an idea when it's important to be [ teaching ] it, whether formally or…

Modified: April 05, 2020.

teaching lessons learned

working with Sinclair, Klein, Abbeel, they’ve all got great experience and advice especially for large classes You don’t have to give the…

Modified: February 07, 2022.

teaching machine learning

Rob wants to firm up his foundations. He wants to understand relevant stats, probabilistic models, inference, and maybe work our way up to…

Modified: January 25, 2022.

television is useless

Epistemic status: either this is true or TV is maybe one of the greatest contributions to human utility ever. Unclear. The average American…

Modified: January 24, 2022.

temporal difference

From David Silver's slides : TD-learning 'updates a guess towards a guess'. Sutton and Barto define the temporal difference error as the…

Modified: April 04, 2022.

ten-year goals

Modified: .

tennis technique

Class 1 forehand grip racquet in right hand, as if picking it up from the ground. hand at bottom of handle. Grip the flat side of the…

Modified: .

tension

This page (first brainstormed in an Otter note) is for issues where I feel pulled in several directions. Different principles seem to yield…

Modified: February 10, 2022.

tensor

Every in machine learning talks about tensors, but no one really understands what they are. This page collects several definitions and…

Modified: July 18, 2022.

tensor product

The tensor product of two vector spaces (defined on the same scalar field, we'll assume ) is the vector space of formal sums of…

Modified: July 18, 2022.

terry tao on statistical mechanics

This post gives a nice, mathematically clear development of basic terms in statistical mechanics. Highlights: Think of a physical system as…

Modified: April 15, 2022.

testable prediction

Modified: .

thai ingredients

thai holy basil / hot basil thai basil turmeric rice noodles: thin (pad thai) or wide (pad see ew / pad kee mao) oyster sauce, fish sauce…

Modified: May 16, 2022.

the appropriate kind of suffering

Modified: February 19, 2020.

the balanced-utility trap

Status: in conflict with [ negative utility ] ? See also: the [ hedonic treadmill ]. Evolutionary, 'pain' exists to motivate you to get out…

Modified: March 20, 2022.

the best things have many stories

I used to think that there was a 'best' way to motivate an area. For example, in VI, the ELBO is derived from the KL divergence between a…

Modified: January 23, 2022.

the buddha solved his problem, now solve yours

[ Tucker Peck ] often mentions this as a thing Sharon Salzberg would say. What does it mean? I don't know - I should ask Tucker to clarify…

Modified: January 25, 2024.

the dance

It's almost never worth worrying about whether an individual action is the right thing to do. It's like trying to dance while worrying at…

Modified: May 16, 2022.

the discourse is wrong

In order for a group of people, like an academic field, or a political elite, to meaningfully converse about a complex topic, they have to…

Modified: February 25, 2022.

the map is not the territory

Metaphor connected to the observation that [ all models are wrong ]. Borges, On Exactitude in Science : ...In that Empire, the Art of…

Modified: .

the mind contains the world

A point made by [ Michael Taft ] in various talks, e.g. The World is Inside You (also the '[ emptiness ] of perception' described by [ Dan…

Modified: February 03, 2025.

the mistake is upstream

Comparing myself to SuccessfulFriend, I might be tempered to think that because he is interested in antitrust law, zoning reform, political…

Modified: August 15, 2020.

the null hypothesis is always wrong

Andrew Gelman believes that in certain areas of research , like the social sciences, everything is connected. "I’m not expressing…

Modified: June 08, 2021.

the privilege of advice working out

In every field, there is a store of 'standard' advice that is handed down from mentors to ambitious youngsters. In computer science grad…

Modified: March 06, 2020.

the purpose of life

The Feynmannian/Sagan/Tyson "scientific" view is that the [ purpose ] of life is understanding : the world is a giant mystery, with layers…

Modified: February 25, 2022.

the self is a construct

It exists, but is [ empty ], insubstantial, a [ fabrication ]. Foregrounding this view is an important part of [ awakening ] or…

Modified: March 23, 2023.

the system is bad

[ things are deeply wrong ]

Modified: August 27, 2021.

the system wants you to have ownership

If I'm managing someone, I want them to be coming up with their own ideas and owning them. Owning their ideas means they will themselves…

Modified: July 10, 2020.

theory of intelligence

tl;dr : the ideas we need to build intelligent systems may be different from those we need to understand them. Both are important, but…

Modified: February 26, 2022.

theory of the case

Several ideas here: When I try to tell a story about what I'd like to change about my life, at a high level, I can come at it from different…

Modified: May 07, 2020.

therapy

Modified: February 10, 2022.

there's never a single cause

For several reasons: multiple object-level causes a telescoping tower of causes at increasing levels of generality or abstraction 'because…

Modified: February 07, 2022.

there are no paradoxes, just bad models

If two statements that both seem true conflict with each other, then it seems like you have a paradox. But the world itself is just as it is…

Modified: January 23, 2022.

there is no speed limit

Modified: July 13, 2020.

theses are great sources

Pointed out in this tweet: https://twitter.com/AmandaAskell/status/1311776280128479238 but also in many other places over the years…

Modified: February 10, 2022.

things I believe that no one else believes

AI is going to work. Obviously lots of people believe this. But most 'AI' companies and 'AI' investors are hyping applications of current…

Modified: January 13, 2023.

things I will always do

No matter what other priorities or any incredibly important goals arise in my life, whether through work, family, or other circumstances…

Modified: February 23, 2020.

things I would like to do

Write Write regularly: under routine circumstances, at least a few minutes per day. This could be filling in nodes of this graph, blogging…

Modified: February 15, 2020.

things are deeply wrong

See also: [ the system is bad ] I find it hard to be okay with a 'normal' life, because that would imply some level of acceptance of the…

Modified: January 25, 2022.

things school should teach

Modified: February 10, 2022.

things that are always productive

These might not be the best thing to do at any point, but they're better than doing nothing. And doing them can create a sense of progress…

Modified: September 12, 2021.

things to build

See also [ writing inbox ] See also [ ongoing projects ] See, first and foremost, the backlinks below. Crypto trading model. Write a system…

Modified: November 13, 2021.

this is all there is

This is one of those things that sounds cliche but is still profound and #fundamental: this is all. There's no great reward in the future…

Modified: July 25, 2020.

thought vector

Modified: .

thoughts about kids

I want kids, eventually. I want to be able to talk with them, to build a relationship, to see the world through someone else's eyes. I want…

Modified: February 25, 2022.

thoughts are actions

The [ agent ] model of intelligence imposes a sharp distinction between the agent and its environment, where the agent 'chooses' actions…

Modified: June 27, 2021.

thoughts on multivariate causalimpact

let's say the signal we see after the intervention is modeled as the combination of the counterfactual forecast and an intervention effect…

Modified: February 15, 2022.

three characteristics

[ impermanence ] [ dukkha ] (unsatisfactoriness) [ no-self|annita ] (no-self) Daniel Ingram's summary: things "come and go, don't satisfy…

Modified: May 19, 2022.

three questions

(fellow student) Smitha has these post-its on her desk: what are you doing? why is it important? are you making progress? I think these…

Modified: February 16, 2022.

tissue paper thin

This is personal mental image for [ emptiness ] that has been really resonant for me, arising from an experience taking [ MDMA ] with a…

Modified: January 24, 2024.

to a first-time employee

It helps a lot to write down the things you think someone should know about working in a new environment. Even if a new person would figure…

Modified: September 07, 2020.

to those whom much is given, much is expected

I feel an obligation to try to do big things with my life, because I've had access to rare opportunities. If ten thousand randomly selected…

Modified: January 25, 2022.

to watch

Television: Arcane ken burns on the vietnam war WandaVision For All Mankind Severance: https://m.imdb.com/title/tt11280740/ Borgen Diplomat…

Modified: March 20, 2022.

tokenize

How should a machine learning model represent text? Word-level and character-level features are obvious options, but both have drawbacks…

Modified: February 13, 2023.

tool AI

Sometimes mentioned as a potential approach to [ AI safety ]. Gwern: Why Tool AIs want to be Agent AIs (roughly: because treating…

Modified: April 07, 2022.

toolformer

Notes on Toolformer: Language Models Can Teach Themselves to Use Tools The basic method is: "Given just a handful of human-written examples…

Modified: February 16, 2023.

trace

Trace of a Linear Operator We define the trace as the sum of diagonal elements of a matrix: Lemma : If and are square, then . Proof…

Modified: March 16, 2022.

tractable approximations to utilitarianism

There are three main approaches to moral philosophy: [ utilitarian ]ism: you should feed a starving person because it will increase 'global…

Modified: June 07, 2021.

training for consistency

These days we think a lot about using data to train large [ language model ]s. But there's only so much data in the world; eventually we'll…

Modified: October 27, 2022.

training researchers

I didn't have a good intuitive understanding of the social landscape of being a researcher (and joining a [ research community ]). When…

Modified: February 25, 2022.

transactions are positive-sum

If you and I agree of our own volition to exchange X for Y, this implies that we both believe we are gaining value in the trade. If one of…

Modified: February 22, 2022.

transformer

The core of the transformer architecture is multi-headed [ attention ]. The transformer block consists of a multi-headed attention layer…

Modified: February 13, 2023.

transformer parallelization math

What does the computational profile of a transformer vs a similar RNN look like? First, the transformer. Let's take the LLama 6.7B model…

Modified: October 04, 2023.

transformer primatives

In developing intuition about [ transformer ]s it's useful to think about specific primitive operations that can be implemented by a small…

Modified: February 13, 2023.

transformer primitives

In developing intuition about [ transformer ]s it's useful to think about specific primitive operations that can be implemented by a small…

Modified: February 13, 2023.

transformers with memory

Incorporating explicit memory and retrieval seems pretty clearly like the next frontier in language modeling and AI more broadly. We have…

Modified: September 03, 2022.

transposes are measures

According to this reddit post , one of the main takeaways of functional analysis is that the right way to interpret the 'transpose' of a…

Modified: November 06, 2020.

trapped priors

SSC link: https://www.astralcodexten.com/p/trapped-priors-as-a-basic-problem (2021 thoughts) How general is this phenomenon? You have a…

Modified: March 12, 2021.

trauma

Sasha Chapin describes trauma as a 'splitting off' of difficult or painful experiences as memories that the mind tries to avoid accessing…

Modified: October 27, 2024.

trituration

Modified: .

true but wrong

A pitfall with relying too heavily on rational deduction is that lots of logically 'true' conclusions are unimportant, or worse yet…

Modified: February 15, 2022.