Ye Zhang 📜

Ye Zhang 📜

15,114個推特粉絲

關注

Cofounder @Scroll_ZKP. Doing research about zero-knowledge proof, system, and hardware.

動態

Claude went full Deep Research mode for an hour, and this is what it came back with:

It will be fantastic if agents + auto-research can start designing real protocols that go beyond performance optimisation i.e. good enough “intuition” behind all the core mathematical components in ZK (polynomial commitment, etc), design new protocols through some combinations twitter.com/yezhang1998/status...

For those who don’t know, this is my favorite feature in ChatGPT: You can branch the current conversation into a new chat while keeping the same context. My usual pattern is this: I’m learning something new, the answer introduces unfamiliar concepts, and I want to dig into those concepts more deeply. But doing that in the main chat can taint the original context. Branching solves that. I can explore those questions in a separate chat, then come back to the main conversation. However, I usually have too many follow-up questions, so it turns into branches of branches of branches, and it quickly becomes hard to manage. I think the best chat app for learning should have a tree structure. You could keep exploring new concepts on different branches, while still being able to manage everything and all context easily in a visual way. I’ve wanted to vibe-code something like this, but I’ve been too lazy + all my past context is already in ChatGPT + I’d rather use my subscription than build on the API. Still, I think this is a real need for power learners like me. Another need is having some separate thread monitor the conversations so it doesn’t go down a rabbit hole, because the user (me) is the only one steering the whole conversation...(similar to the problems mentioned in last tweet, I keep asking random distracted questions and need some "advisor" to stop me stuck in the same place with a global view). @sama @ChatGPTapp @JustinBleuel

My recipe for learning: Learn slowly Go deep Build real intuition Then knowledge will compound

.@originalmaderix just got Apple’s Neural Engine to do training, not just inference. He reverse-engineered the private ANE stack and ran forward + backprop directly on the ANE. (Previously, Apple mainly exposes ANE through Core ML, and doesn’t provide a public training API/docs for ANE.) If this scales, local fine-tuning and always-on experimentation become cheap, quiet, and private. It’s still an early PoC: single-layer demo, some CPU fallback (dW/Adam), and private APIs that could break. But the efficiency upside is real, super exciting!

Today, Anthropic is big enough to say no. However, most people don’t really have “choice” unless they’re big enough to matter. Terms of service is basically a binary button: accept or leave. And if you need the service, “leave” isn’t realistic. If every major LLM collects your data by default, how many people can truly opt out? (Yes, privacy-preserving models exist. But if those modes are much weaker, they aren’t real alternatives. Many of those companies also die because they lack access to data as a flywheel. You have to survive long enough and be at the table to even talk about your values.) When intelligence and distribution concentrate in a few hands, companies don’t just build products. They set the rules. Everyone else adapts. This reminds me of that tweet a while ago: Gemini basically required activity collection to use it, or your AI would forget every conversation 👇

Gemini has gotten way better technically, but its privacy model is still a hard no for me. You're forced into this tradeoff: • Activity ON → Keep chat history, but chats can train Google's models • Activity OFF → No training use, but history deleted after 72 hours That's

In-context local voice model built for games. Very proud of you guys, @LuozhuZhang @ChrisYicheng!

I taught a speech model to understand context in conversation. This is what happened It adjusts voice and tone to express urgency, comfort, understanding from the dialogue. Just like a real human being 520M model. Runs locally on consumer devices How this is achieved 🧵

There are things I like and dislike about @AnthropicAI, but a few facts: 1) They have a clear, differentiated opinion and strategy. 2) They’re solid operators: execute well, ship fast, and the product is impressive. 3) The scarcest trait today is being principled, and they take that seriously.

A statement from Anthropic CEO, Dario Amodei, on our discussions with the Department of War. https://anthropic.com/news/statement-department-of-war…

AI-native “airdrop” for open-source developers Really smart: massive brand boost + long-term model training fuel from the best devs on the planet...

Excited to announce Claude for Open Source ❤️ We're giving 6 months of free Claude Max 20x to open source maintainers and core contributors. If you maintain a popular project or contribute across open source, please apply! https://claude.com/contact-sales/claude-for-oss…

最近timeline很多人开始转向gemini 我这里来提个醒，我是没感觉gemini比claude+gpt好多少，并且发现它数据收集和隐私设置是我完全没法接受的用gemini的话，你只能被迫选择 1. 全部关掉数据收集（那么你所有的chat记录只能保留72h） 2. 完全打开数据收集（那么你的chat会被用于google以后的训练）其他任何一个大模型，Grok GPT Claude都是允许你保留对话数据，但是选择数据不被用于训练的，只有Google刻意没有这么做……我觉得这个实在是绑架的太厉害了，毕竟一个72h memory的chat box也没啥用…… Google会做的很好，它有gmail，drive，youtube，还有这么多chat的数据，会做的越来越好，但是想象一下哪一天你的很多信息直接成为了model pre-train的一些knowledge，这该是多么可怕的事情而且大家都开始这么恶性竞争，都强制用户提交数据的话，实在是会让整个生态环境也变得越来越糟糕

Gemini has gotten way better technically, but its privacy model is still a hard no for me. You're forced into this tradeoff: • Activity ON → Keep chat history, but chats can train Google's models • Activity OFF → No training use, but history deleted after 72 hours That's

Loading..