要和 DeepSeek 拚了！Meta 發表 Llama 4 AI 模型採多模態設計，具備 1,000 萬 token 處理能力

Meta 宣布推出全新 Llama 4 AI 系列模型，在 4/5 率先揭露 2 款模型：Llama 4 Scout 和 Llama 4 Maverick，並預告將推出另外 2 款尚在訓練階段的進階模型：Llama 4 Behemoth 與 Llama 4 Reasoning。

中國科技巨頭動作頻頻，刺激 Llama 進入高速研發階段

根據《Bloomberg》和《TechCrunch》報導，Meta 此舉被視為正面迎戰中國開源 AI 模型的發展。除了 DeepSeek 的開源 AI 模型，近期，中國多家科技巨頭加快推出新 AI 模型與應用，包括百度開放免費使用 Ernie Bot、阿里巴巴推出數款號稱超越 DeepSeek 的模型，騰訊更將 DeepSeek 整合進微信，都促使 Meta 的 Llama 研發進入高速發展階段。

Meta 的 Llama 4 標榜採用多模態設計，並首度採用混和專家架構（MoE）──這也是 DeepSeek 用來降低成本、提高效率的作法。Meta 指出，Scout 模型具備 170 億個有效參數與 16 個專家模型，具備極快的推理速度與 1,000 萬 token 的超長上下文處理能力（DeepSeek 可處理 token 數為 6.4 萬），能夠處理多文件摘要、大型程式碼推理等任務，並且可在單一 NVIDIA H100 GPU 運行。

Maverick 擁有 17 億個啟用參數與多達 128 個專家模型，擅長處理圖像與文字理解任務，適用於助理與對話等應用場景。Meta 表示，Maverick 在多項基準測試中超越 GPT-4o 與 Gemini 2.0，而在推理與程式寫程式能力上，與 DeepSeek v3.1 相媲美。

Meta 開發者大會即將到來，預料揭曉 2 兆參數新模型

Meta 預告，尚未發表的 Behemoth 模型，擁有 2,880 億個有效參數，總參數近兩兆，並在多項 STEM 領域表現超越 GPT-4.5、Claude 3.7 Sonnet 與 Gemini 2.0 Pro。Meta 執行長祖克柏表示，Behemoth 是「世界上最聰明的 LLM 之一」。

《Bloomberg》指出，Meta 的 AI 開發者大會將在幾週後到來，屆時將能聽到 Meta 更多有關 Behemoth 和推理模型的消息。

【推薦閱讀】

◆ AI 基礎設施要用雲端還是自己建？5 個企業管理者最大化投資效益的關鍵判斷

◆ Gemini 2.5 Pro 為企業解決「AI 黑盒子」問題，4 大特色挑戰 OpenAI、Claude

◆ 【工程師只是第一批】AI 職位替代會慢慢開始並突然爆發，企業如何因應？

＊本文部分初稿由 AI 生成，經《TechOrange》編撰，資料來源：Meta、《Bloomberg》、《Engadget》、《TechCrunch》，首圖來源：Meta。

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

要和 DeepSeek 拚了！Meta 發表 Llama 4 AI 模型採多模態設計，具備 1,000 萬 token 處理能力

中國科技巨頭動作頻頻，刺激 Llama 進入高速研發階段

Meta 開發者大會即將到來，預料揭曉 2 兆參數新模型

TO 會員電子報

AI Agent 進公司誰來管？Accenture 點名 HR 扛責，PwC 示警入門職缺「資深化」

攔截消費決策最起點：房產巨頭 Zillow 布局 NotebookLM，讓 AI 化身購屋族專屬軍師

Human-in-the-Loop 不再是黃金標準？亞馬遜揭 AI Agent 治理最大盲點

「3 成企業成功獲得 AI 投資回報，7 成企業尚未跨過應用門檻。」博弘雲端 Nextlink AI Solutions Day 與各領域專家共探零售業如何落實 AI Agent 商業價值