Yohei

Yohei

17,955 Twitter followers

Follow

GP @UntappedVC. Artist: @pixelbeastsnft, @animalbuildings. Build-in-public log: http://yohei.me

Posts

Anthropic making its own splash with an acquisition today $400M for Coefficient Bio, started last fall, developing an AI drug R&D platform

made a tool to compare star-to-LOC ratio on github cuz i thought i would do well, but @karpathy is 👑 any others i should try?

there are some really cool founders visiting SF from Japan to build their startups did my part this morning helping bridge US/Japan :) twitter.com/yoheinakajima/stat...

Everyone wants to plug an OpenClaw into their database. Nobody wants to explain to their CDO why they dropped their entire Snowflake at 2AM. We just shipped sandbox proxies. Every query the agent writes gets validated, proxied, and whitelisted before it touches anything. Go be the person who plugs an AI agent into prod and doesn't get fired. Create an API key. $100 free credits, $300 per org. Offer ends April 1st.

I ran a 35-billion parameter AI agent on a $600 Mac mini. Specs: M4 Mac-Mini 16GB RAM The model doesn't fit in RAM. It pages from the SSD at 30 tokens/second. On NVIDIA, the same paging gives you 1.6 tok/s. Apple Silicon gives you 30. That's 18.6x faster. No cloud. No API keys. $0/month. Here's what it can do 🧵

Which local models can actually handle tool calling? I built a framework to find out. 15 scenarios. 12 tools. Mocked responses. Temperature 0. No cherry-picking. Tested every Qwen3.5 size from 0.8B to 397B, and since some of you asked after the distillation tests: yes, I included Jackrong's Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled too. Only two models went all green: the 27B dense and the distilled 27B. The 397B? Failed two tests. The 122B? Failed one. The 35B? Failed two. The timed-out results — mostly on the smaller models, are cases where the model got stuck in a loop, repeating the same tool call until it hit the 30-second limit. The test that exposed the most models: "Search for Iceland's population, then calculate 2% of it." Simple, but 35B, 122B, and 397B all used a rounded number from memory instead of the actual search result. They didn't trust their own tool output. Small models hallucinate data. Big models ignore data. The 27B just threaded it through.

2 days left to apply for the fellowship! Applications close March 27th - we're giving grants to entrepreneurs and early access to Cofounder 2 + credits to a few motivated people to help them start companies. - Open internationally - Run digitally - Only catch is we get to do a case study if it works out, and that you give us feedback on the product! twitter.com/ndrewpignanelli/st...

while I agree with his general thinking about how we’ll rethink home design, seems optimistic to think that a meaningful percentage of homes would be redesigned in a 5-10 yr period

.@mcuban says humanoid robots won't last more than 5-10 years. Instead, we'll "design the house to fit the robot, and design the robot to fit the house." "You could create a house where the pantry, the refrigerator, and the washing machines were hidden behind the garage, if

it's time to drop three new #opensource robotic hands! this time with tactile sensors! Tweak it, 3D print it, and use them in your robotics and physical AI research! Here are some wild examples ↓↓↓

Manus ex-backend lead had a genius insight text based clis beat structured tool calling for ai agents all day because unix commands appear in training data going back to the 1970s text is the native language of the command line AND text is the native language of llms

Loading..