24-06 Links
AI
How to create a web-scale dataset for training an LLM
Interestingly they stopped deduplicating as aggressively as they had been. Turns out the most unique things on the web are ads and other assorted crap.
Terence Tao interviewed about AI theorem provers
Analysis looking at when we might hit the data wall
Introducing Dream Machine - a next generation video model for creating high quality, realistic shots from text instructions and images using AI. It’s available to everyone today! Try for free here https://t.co/rBVWU50kTc #LumaDreamMachine pic.twitter.com/Ypmacd8E9z
— Luma AI (@LumaLabsAI) June 12, 2024
Shadow Workspace: Iterating on Code in the Background
Very cool blog by the Cursor folks showing some of the stuff they do behind the scenes. Not just a VS Code fork with a built in LLM chat window (though that is very useful on its own).
And it's basically free since you can cancel your copilot subscription!
A theory of why Claude 3.5 Sonnet is insane at coding: mechanistic interpretability.
— Deedy (@deedydas) June 28, 2024
Anthropic showed that there are clever ways to understand what the weights of LLMs do and "steer" them to behave differently.
Doing this on Sonnet may be why it crushes it at code:
🧵
1/12 pic.twitter.com/JGR7Mr8xuq
*overhead internally at anthropic*
— Aidan McLau (@aidan_mclau) June 22, 2024
yeah so once we identified the <i'm really smart> feature, it was just a matter of turning it on, and we passed every llm without retraining https://t.co/G6Bc9mobg1
The Many Ways that Digital Minds Can Know
Yes. Word processing is bad.
— Christian Keil (@pronounced_kyle) June 29, 2024
Before it, document length was bounded by the human capacity to copy long-form text & about to plateau.
After word processing, the tax & legal codes grew without check. And became incomprehensible to anyone but career experts. https://t.co/YvQmmYGv5D pic.twitter.com/TCbEfRMONp
Time to reshare this:
AI is going to create 10x legal work, more lawyers. In the 70s you could do a multimillion $ deal on 15 pages because retyping was a pain in the ass.
— Preston Byrne (@prestonjbyrne) April 3, 2023
AI will allow us to cover the 1,000 most likely edge cases in the first draft and then the parties will argue over it for weeks. https://t.co/PnRyXuVQ7U
As surely as Water will wet us, as surely as Fire will burn, The Gods of the Copybook Headings with terror and slaughter return!
Building stuff
Unfortunately, this "parking lot in the CBD" phenomenon is the norm across most West African cities.
— Tonami Playman (@TonamiPlayman) June 4, 2024
Lagos's CBD on Lagos Island has an entire strip of surface parking. The office spaces in Victoria Island also are surrounded by parking. https://t.co/Gqf7qXDhmd pic.twitter.com/b1a3aZWQVQ
Eric Parry unveils rejigged designs for City of London’s tallest tower
I discovered the Thesis Driven newsletter this month. It's great:
Seven US Transit Projects Real Estate Investors Need to Know
New data suggests it could take nearly 20 years for S.F.’s struggling office market to recover
Less than 5 years ago, Salesforce (SF’s biggest employer) led the charge to pass Prop C which disproportionately affected financial services businesses. Today, Stripe, Block and Schwab have all left SF and Salesforce continues to dump office space. https://t.co/v8cAmxVRuP
— Cristina Cordova (@cjc) April 11, 2023
san francisco bureaucrats could have used any of the multiple payroll providers based in sf, but that would have meant a tech company makes money—so instead they burned $34M tax dollars on a custom system that doesn’t work https://t.co/tKxXajoPba
— Kane 謝凱堯 (@kane) March 1, 2024
10 Reasons Why Planning Lawyers Are So Busy (And Maybe Shouldn’t Be)
Ten new cycle routes completed across London
TfL and the boroughs’ continued work to develop Cycleways in London means the strategic cycle network has more than quadrupled in size from 90km in 2016 to 390km in June 2024.
Unveiling AVA: The Adaptable Footbridge and Lift System of the Future
AVA is faster to deliver and simpler to install compared to standard industry practices. The streamlined approval and design periods, coupled with the modular system featuring bolted connections, transform the assembly process. With internal cladding, glazing, and services installed before the bridge leaves the yard, only a single possession is required.
It's not particularly aesthetic but it is functional and cool to see real attention given to construction productivity and driving costs down
Unable to embed this tweet.
101 things we now know about US housing markets
Ronan Lyons has started a newsletter looking at findings from his work on a historical dataset of US house prices.
First Universal theme park in Europe to generate '£50bn of economic benefits for UK'
Locally, not far from the proposed Bedfordshire site is the Harry Potter Experience at the Warner Bros studio tour near Watford, while there is Woburn Safari Park to the immediate north and Whipsnade Zoo to the immediate west of Luton.
Can Labour build 1.5 million homes?
300k a year isn't going to cut it anyway. Need to pump those numbers up!
Incredible policy failure...
Life east of the 5: sometime around 2020, California quietly passed a major milestone: there are now more of us living east of I-5 than west of it. 🏡📈
— Alfred Twu (@alfred_twu) June 20, 2024
EastCal is our future.
1/ pic.twitter.com/5a913DuS91
Science and other tech stuff
...One way to treat the symptom!
🌁 SF BAY AREA NETWORK UNVEIL | We’re excited to announce Archer’s planned air taxi network in the San Francisco Bay Area. This goal is to connect 5 key locations across the region: South San Francisco, Napa, San Jose, Oakland, and Livermore, replacing long drives with quick… pic.twitter.com/Z17B3tHtLR
— Archer (@ArcherAviation) June 20, 2024
History of nanopores and their use in DNA sequencing
Solving Atherosclerosis: The Small but Mighty Molecule
We’re continuing to engage with the MHRA. In fact, we have another scientific advice meeting in two weeks that we’ll be holding virtually. We’re excited to be working with the MHRA and hopefully doing part of our Phase 2 clinical trial in the UK.
To remind your readers, we were one of the first recipients of the UK’s ILAP program, the innovative licensing and access pathway, and that’s what really brought us to the UK. In addition to the good environment there, lots of collaborators, lots of innovation happening, especially in the imaging field in the UK.
The bad thing is that post Brexit, it seems that the MHRA has gotten a bit backlogged and isn’t able to keep up with our current demands on their time. It takes too long to get meetings and responses to applications currently. We’ve had to take our first human clinical trial to Australia, where it’s a faster, more streamlined, and cheaper process.
We are really excited to be working with some great people there. Stephen Nicholls, a world-renowned cardiologist, who we brought on as an advisor, has really helped pave the way and show us the ropes of how to navigate the system and get things going really fast in Australia. We think we’ll be able to efficiently get our trial done there.
Will We Ever Get Fusion Power?
And some AI fusion progress this month...
https://x.com/pfau/status/1800961530084643127
How AI Revolutionized Protein Science, but Didn’t End It
Join @elonmusk for a tour inside @SpaceX's Starbase and the brand new Starfactory. This video was shot the day before Flight 4, on June 5th, 2024. Part 2 comes out next week! pic.twitter.com/0YsIYelGq5
— Everyday Astronaut (@Erdayastronaut) June 22, 2024
London Underground hosts tests for ‘quantum compass’ that could replace GPS
The Untapped Potential of Geothermal Energy
Business / Finance / Management Stuff
High frequency trading is cool and good actually
How to hire low experience, high potential people
Managing Lockheed’s Skunk Works
How Alexa dropped the ball on being the top conversational system on the planet
— Mihail Eric (@mihail_eric) June 11, 2024
—
A few weeks ago OpenAI released GPT-4o ushering in a new standard for multimodal, conversational experiences with sophisticated reasoning capabilities.
Several days later, my good friends at PolyAI…
On Alpine's CEO training programme
https://www.bloomberg.com/opinion/articles/2024-06-24/boomer-candy-business-is-booming
Nice overview of the insurance industry. Lots of non-household names that keep it ticking along behind the scenes
People
New York or London — what’s your table talk style?
Hadn't heard the phrase "single table conversation" before. I think it could work well. But probably the responsibility of the host.
Is app-based dating ruining everything?
It's genuine challenge going into dating mode cold. Can feel quite transactional and romance-less.
Miscellaneous
The successful capture and prosecution of prolific phone snatcher Sonny Stringer shows our focus on detecting and bringing to justice those who snatch phones.
— City of London Police (@CityPolice) June 5, 2024
PC Smith, who is trained in tactical pursuit & containment (TPAC), took the decision to make contact with his e-bike. pic.twitter.com/DWdDF3Aw5c
24 phones were found on him. How many previous convictions?
DNA Spray is a cool idea. It seems to be helping the City of London police.
Hadn't thought of this before. Couldn't it have been Apple cracking down on ad tracking though?
in this essay i will discuss how tiktok killed mobile gaming👾🎮 pic.twitter.com/vq58niiS5x
— andrew gao (@itsandrewgao) June 2, 2024
Some good advice in this talk for anyone who needs to work with clients or who aspires to write clearly
Alternative Greenwich pub recommendations from a friend, after I mentioned the Trafalgar Tavern:
The Old Brewery if the weather is good. It has infinite space (as it spills into the grass in Wren's ORNC), and the setting is actually considerably nicer than Trafalgar, looking over the UNESCO world heritage site and the river
Prince of Wales (Blackheath)
Hare & Billet (Blackheath)
Richard I (Greenwich)
The Cutty Sark (pub, Greenwich, a few mins further along the riverside from Trafalgar). Oldest pub in Greenwich and it shows!
Coach & Horses (inside the covered market)
Gipsy Moth, but it's completely generic
Guildford Arms is very nice with a big outside idea
Why Is Everyone on Steroids Now?
Reminds me of this Palladium piece from a few years ago
Narrative violation: Almost everyone that applies for an O-1 visa to work in America actually ends up getting it
— Ankur Nagpal (@ankurnagpal) June 15, 2024
Latest approval rates are as high as 95%
If you're on the fence, and worried you are not "extraordinary" enough, let this be your sign to get over yourself & apply pic.twitter.com/NQ5QSvqzI3
I adore Jimmy Maher's Analogue Antiquarian blog. He's currently telling the story of Ferdinand Magellan
Remember, kids, Magellan's A LOT COOLER THAN JUSTIN BIEBIER
Here’s the reupload of the Jones/FF video for yall since I had a little snafu with X @Support
— Pete Oxenham (@peteoxenham) June 24, 2024
Technical issues cannot defeat the human spirit!! pic.twitter.com/DNxvAN5k0h
There Is No Such Thing As Supply
But it will be no use asking, ‘What have I done to deserve this?’ The Straightener will reply: ‘But, my dear fellow, no one’s blaming you. We no longer believe in retributive justice. We’re healing you.’
Small drones will soon lose combat advantage, French Army chief says
Meanwhile...
China State Shipbuilding Corporation
Someone has put all of Paul Graham's essays on Github as an ePub
1329 pages (so far!)
Zepto, a 10-minute delivery app, raises 3.6B valuation
Do the economics of this work in India in a way that they didn't here a few years ago? Cheaper labour, existing dabbawala network? Dunno. But it would be cool if VC subsidised deliveries come back