Published inThoughts and Reflections by Winston WangDeepSeek, Geopolitics, and the Real AI Battleground: Innovation, Talent, and Strategic AdvantageFeb 4Feb 4
Published inResearch Highlights by Winston WangC’mon, Stop Wowing! Let’s See Why the AI World Is Going Wild Over DeepSeek’s Latest R1!The shock brought to us by the Chinese company DeepSeek continues. Less than a month after the release of their V3 model, they introduced…Jan 26Jan 26
Published inPaper Skimming by Winston WangForget Transformers? Meet Titans, the Next Big Leap in AI Memory and EfficiencyThe transformer architecture faces numerous challenges, such as Mamba, linear recurrent models, and the recently revived Test-Time Training…Jan 23Jan 23
Published inResearch Highlights by Winston WangDeepSeek in a Nutshell: Everything you need to know at a GlanceRecently, the large language model (LLM) DeepSeek has gained widespread attention. Out of curiosity, I conducted a brief investigation to…Jan 6Jan 6
Published inPaper Skimming by Winston WangExploring Implicit Chain of Thought (ICoT): A Faster but Unproven Alternative to CoT ReasoningSince the release of OpenAI’s o1 model, Chain of Thought (CoT) prompting has gained widespread attention for its ability to enhance…Jan 2Jan 2
Sequoia’s Vision for AI in 2025: Key Predictions That Will Shape the FutureAs 2024, dubbed the “primordial soup year” for AI, draws to a close, the AIGC ecosystem has seen explosive growth across infrastructure…Jan 2Jan 2
Published inResearch Highlights by Winston WangAI’s Mathematical Myth Busted? FrontierMath Leaves LLMs Nearly Empty-HandedAlthough many large language models (LLMs) have recently demonstrated high scores in certain mathematical tests, growing research suggests…Jan 1Jan 1
Published inPaper Skimming by Winston WangEvaluating the Mathematical Reasoning Capabilities of Large Language Models: Limitations and…LLMs have made remarkable progress in various fields, including natural language processing, question answering, and creative tasks, even…Oct 15, 2024Oct 15, 2024
Published inResearch Highlights by Winston WangUnveiling AlphaFold 3: The Next Leap in Predicting Biomolecular Structures Across the Chemical…On October 9, 2024, the Royal Swedish Academy of Sciences decided to award half of the 2024 Nobel Prize in Chemistry to Demis Hassabis and…Oct 12, 2024Oct 12, 2024
Published inTutorial by Winston Wang在Apple Silicon上使用MLX对模型进行微调的新手教程目前市面上针对大语言模型微调的 python库,例如unsloth and lamini,在Apple M系列芯片上都不支持GPU加速。Apple Silicon芯片下的Mac使用MLX进行finetuning是一个很好的选择。Oct 2, 2024Oct 2, 2024