주제

#AI

450개 읽을거리를 모았습니다.

2026년 6월 27일

AWS Lambda MicroVMs - serverless sandboxes target AI-generated code execution

AWS는 2026년 6월 22일 Lambda MicroVMs를 발표하며, 사용자 또는 AI가 생성한 코드를 isolated, stateful execution environment에서 실행할 수 있는 serverless compute primitive를 공개했다. Firecracker 기반 VM-level isolation, near-instant launch/resume,…

개발도구

읽기

2026년 6월 27일

DeepSpec - speculative decoding becomes an open production optimization stack

DeepSeek은 speculative decoding draft model을 훈련하고 평가하기 위한 MIT-licensed DeepSpec repository를 공개했다. README 기준 DeepSpec은 data preparation, draft model implementation, training, evaluation scripts를 포함하며 DSpark, DFlash,…

오픈소스

읽기

2026년 6월 27일

GPT-5.6 Sol preview - frontier model releases become policy-gated infrastructure decisions

OpenAI는 2026년 6월 26일 GPT-5.6 series의 limited preview를 발표하며 Sol, Terra, Luna 3개 tier와 새로운 max reasoning effort, subagent 기반 ultra mode를 공개했다. Sol은 Terminal-Bench 2.1, GeneBench v1, ExploitBench, ExploitGym 같은 장시간…

OpenAI

읽기

2026년 6월 26일

General Intuition Series A - gameplay data becomes the next action-model training substrate

General Intuition은 Khosla Ventures가 lead한 3억 2천만 달러 Series A를 발표하며, 가상 및 물리 환경에서 perceive, predict, act할 수 있는 모델을 만들겠다고 밝혔다. 보도에 따르면 post-money valuation은 23억 달러이며, TechCrunch는 이번 라운드 이후 누적 공개 funding이 4억 5,400만 달러라고…

산업

읽기

2026년 6월 26일

HF Jobs vLLM server - throwaway OpenAI-compatible endpoints get pay-per-second GPUs

Hugging Face는 HF Jobs에서 vLLM server를 한 번의 CLI 명령으로 띄워 private OpenAI-compatible LLM endpoint를 만들 수 있는 흐름을 공개했다. 서버 프로비저닝이나 Kubernetes 없이 pay-per-second GPU 인프라에서 테스트, eval, batch generation 용도로 빠르게 사용할 수 있다는 점을 전면에 내세웠다.

개발도구

읽기

2026년 6월 26일

QHexRT - Qualcomm Hexagon NPU inference moves small LLMs fully on-device

RunAnywhereAI는 Qualcomm Hexagon NPU용 full-stack inference engine인 QHexRT를 공개했고, 첫 catalog entry로 Liquid AI의 LFM 2.5 230M을 지원한다. 발표는 decode graph, prefill graph, lm-head, embeddings까지 inference path의 모든 tensor가 HTP에 머무르며…

개발도구

읽기

2026년 6월 25일

Claude Tag - Slack-native team agents move from private assistants to shared workspaces

Anthropic은 Slack에서 @Claude를 태그해 팀 단위로 작업을 위임하는 Claude Tag beta를 공개했다. Claude Enterprise와 Team 고객 대상이며, channel-scoped memory, tool/data/codebase access, ambient updates, spend limits, activity logs를 제공한다.

Claude

읽기

2026년 6월 25일

GLM-5.2 - open long-context models push agentic coding toward 1M-token workspaces

Z.AI는 GLM-5.2를 공개하며 1M-token context, flexible effort levels, MIT license, long-horizon coding benchmark 성능을 전면에 내세웠다. 공개 글은 IndexShare로 1M context에서 per-token FLOPs를 2.9x 줄이고, Terminal Bench 2.1 81.0, SWE-bench Pro…

모델

읽기

2026년 6월 25일

Microsoft AutoJack - browsing agents expose local MCP control planes to RCE

Microsoft Defender Security Research Team은 AutoGen Studio 개발 빌드에서 browsing agent가 악성 웹페이지를 렌더링하면 local MCP WebSocket을 통해 host process를 실행할 수 있는 AutoJack chain을 공개했다. 이 chain은 localhost origin trust, MCP path auth…

개발도구

읽기

2026년 6월 24일

FFASR Leaderboard - voice AI benchmarks move from clean speech to far-field reality

Hugging Face와 Treble Technologies는 Far-Field ASR(FFASR) Leaderboard를 공개해 ASR 모델을 reverberation, background noise, competing speech, room acoustics 같은 실제 far-field 조건에서 비교할 수 있게 했다. 기존 clean/near-field benchmark 중심 평가가…

모델

읽기

2026년 6월 24일

Kog Laneformer 2B - latency-first coding models move architecture into the serving layer

Kog는 Hugging Face에 Laneformer 2B의 weights와 model code를 공개했다. 이 모델은 2.3B parameter instruction-tuned coding model로, Delayed Tensor Parallelism과 lane-structured Transformer를 통해 batch-size-one decoding latency를 모델 아키텍처…

모델

읽기

2026년 6월 24일

Krea 2 technical report - open image models compete on creative control, not only fidelity

Krea는 Krea 2 technical report를 공개하며 K2 Raw와 K2 Turbo 계열의 open-weights text-to-image foundation model을 설명했다. 보고서는 data curation, diffusion transformer architecture, multi-stage training, prompt expander, style-reference…

모델

읽기

2026년 6월 24일

NVIDIA NeMo AutoModel - MoE fine-tuning gets a drop-in performance path for Transformers

NVIDIA와 Hugging Face는 Transformers v5 위에서 NeMo AutoModel을 사용해 MoE fine-tuning을 가속하는 방법을 공개했다. NeMo AutoModel은 Expert Parallelism, DeepEP fused all-to-all dispatch, TransformerEngine kernels를 추가해 같은 from_pretrained() 계열…

개발도구

읽기

2026년 6월 23일

Fika Jobs - AI interview agents expose the product-risk tradeoff in hiring automation

TechCrunch는 Stockholm 기반 Fika Jobs가 AI interview agents와 short-form video profiles를 결합한 hiring platform으로 400만 달러 pre-seed를 유치했다고 보도했다. 후보자는 LinkedIn profile을 연결하고 Gemini 기반 agent가 생성한 약 10분 interview를 수행하며, Fika는 이를 짧은…

에이전트

읽기

2026년 6월 23일

Google Jules evals - coding agents need insight-policy benchmarks, not just SWE-bench tasks

Google Developers Blog는 Jules 연구를 통해 proactive coding agent 평가가 단일 bug fix 성공률이 아니라 insight policy를 측정해야 한다고 주장했다. 내부 Google codebase의 705 bugs와 1,178 CLs를 이용해 related bug cluster를 aspirational goal로 재구성하고, agent가 3회…

개발도구

읽기

2026년 6월 23일

huggingface_hub weekly release CI - open-weight agents make release automation auditable

Hugging Face는 huggingface_hub를 4-6주 주기에서 weekly release로 바꾼 GitHub Actions 기반 release pipeline을 공개했다. OpenCode, GLM-5.2 open-weight model, HF Inference Providers, PyPI Trusted Publishing을 사용하되, release notes와…

오픈소스

읽기

2026년 6월 23일

OpenAI Patch the Planet - AI-assisted security needs maintainer-controlled remediation loops

OpenAI는 Trail of Bits와 함께 Patch the Planet을 공개해 cURL, NATS Server, pyca/cryptography, Sigstore, aiohttp, Go, Python 등 주요 OSS 프로젝트에 AI-assisted security research와 human expert review를 결합한다. Daybreak/Codex Security 흐름은…

OpenAI

읽기

2026년 6월 22일

Intel XPU Kernel Skill - coding agents optimize Triton kernels beyond CUDA-first defaults

Intel DCG AI Software와 OCTO Parallel Computing Lab은 Hugging Face Kernel Hub용 Intel XPU Kernel Skill을 공개했다. Xe-Forge 기반 CoVeR loop는 LLM이 Triton kernel을 생성, 검증, benchmark, refine하도록 만들며 Arc Pro B70에서 KernelBench Level-2…

개발도구

읽기

2026년 6월 22일

MosaicLeaks - deep research agents can leak private facts through harmless-looking searches

ServiceNow와 Hugging Face는 deep research agent가 private local documents와 web retrieval을 함께 사용할 때 외부 검색 쿼리만으로 민감 정보가 새는 MosaicLeaks 문제를 제시했다. 제안한 PA-DR training은 strict chain success를 48.7%에서 58.7%로 올리면서…

에이전트

읽기

2026년 6월 22일

PP-OCRv6 on Hugging Face - document AI stays specialized, small, and multilingual

PaddlePaddle은 Hugging Face에서 PP-OCRv6를 공개하며 1.5M, 7.7M, 34.5M parameter의 tiny/small/medium OCR tier를 제공한다고 밝혔다. medium/small tier는 50개 언어를 지원하고, medium은 자체 multi-scenario benchmark에서 detection Hmean 86.2%, recognition…

모델

읽기

2026년 6월 22일

Reflection-SpaceX compute deal - open-source frontier AI hits a capacity wall

Nvidia-backed Reflection AI가 SpaceXAI의 Colossus 2 compute에 접근하는 대형 계약을 체결한 것으로 보도됐다. 계약 구조는 2026년 7월 1일부터 2029년까지 월 1.5억 달러, 총 약 USD 6.3B 규모로 알려졌고, Reflection은 GB300급 compute를 확보해 open-source frontier model 경쟁을 이어가려 한다.

산업

읽기

2026년 6월 21일

Arcade Series A — enterprise agents need an authorization layer, not just MCP gateways

Arcade.dev는 SYN Ventures 주도, Morgan Stanley와 Wipro 참여로 6,000만 달러 Series A를 유치해 누적 7,200만 달러를 확보했다고 발표했다. 회사는 production AI agent를 위한 secure action layer를 표방하며 authorization, reliability, governance를 핵심 문제로 제시한다.

에이전트

읽기

2026년 6월 21일

Cloudflare Temporary Accounts — coding agents can deploy Workers without human signup flow

Cloudflare는 2026년 6월 19일 AI agents가 wrangler deploy --temporary로 계정 생성, OAuth, API token 발급 없이 Workers를 배포할 수 있는 Temporary Accounts 기능을 공개했다. 배포된 Worker는 60분 동안 유지되며, 사용자가 claim하면 영구 계정으로 전환할 수 있다.

에이전트

읽기

2026년 6월 21일

GitHub Code Quality GA — code governance becomes subscription plus AI metering

GitHub는 Code Quality가 2026년 7월 20일 public preview에서 GA로 전환되며 유료 제품이 된다고 공지했다. 가격은 enabled repository의 active committer당 월 10달러에 AI-powered 기능 사용량 과금이 추가되고, deterministic CodeQL 분석은 GitHub Actions minutes를 소비한다.

개발도구

읽기

2026년 6월 21일

NVIDIA Cannes AI marketing stack — agentic workflows move into campaign operations

NVIDIA는 Cannes Lions 2026 기간 Alembic, AWS, Criteo, Higgsfield, KERV.ai, Taboola 등이 NVIDIA infrastructure와 agent toolkit으로 광고·마케팅 AI를 운영 사례로 시연한다고 밝혔다. 사례에는 Criteo의 Blackwell 기반 약 2배 학습 속도 개선과 연 17,000 GPU hours 절감,…

산업

읽기

2026년 6월 20일

Adani-Jabil AI infra alliance — AI 경쟁이 모델에서 전력·랙·제조 공급망으로 확장된다

Adani Group과 Jabil은 2026년 6월 15일 India에 vertically integrated AI and data center infrastructure manufacturing platform을 만들기 위한 strategic alliance intent를 발표했다. 목표는 multi-GW high-density AI rack, liquid-cooled server,…

산업

읽기

2026년 6월 20일

Norway school AI restrictions — 초등 AI 금지가 교육용 AI 확산의 반작용을 보여준다

Reuters 보도에 따르면 Norway는 2026년 8월 새 학기부터 1~7학년(6~13세)의 generative AI 사용을 원칙적으로 금지하고, 14~16세는 교사 감독 아래 제한적으로 허용한다.

#AI

AWS Lambda MicroVMs - serverless sandboxes target AI-generated code execution

DeepSpec - speculative decoding becomes an open production optimization stack

GPT-5.6 Sol preview - frontier model releases become policy-gated infrastructure decisions

General Intuition Series A - gameplay data becomes the next action-model training substrate

HF Jobs vLLM server - throwaway OpenAI-compatible endpoints get pay-per-second GPUs

QHexRT - Qualcomm Hexagon NPU inference moves small LLMs fully on-device

Claude Tag - Slack-native team agents move from private assistants to shared workspaces

GLM-5.2 - open long-context models push agentic coding toward 1M-token workspaces

Microsoft AutoJack - browsing agents expose local MCP control planes to RCE

FFASR Leaderboard - voice AI benchmarks move from clean speech to far-field reality

Kog Laneformer 2B - latency-first coding models move architecture into the serving layer

Krea 2 technical report - open image models compete on creative control, not only fidelity

NVIDIA NeMo AutoModel - MoE fine-tuning gets a drop-in performance path for Transformers

Fika Jobs - AI interview agents expose the product-risk tradeoff in hiring automation

Google Jules evals - coding agents need insight-policy benchmarks, not just SWE-bench tasks

huggingface_hub weekly release CI - open-weight agents make release automation auditable

OpenAI Patch the Planet - AI-assisted security needs maintainer-controlled remediation loops

Intel XPU Kernel Skill - coding agents optimize Triton kernels beyond CUDA-first defaults

MosaicLeaks - deep research agents can leak private facts through harmless-looking searches

PP-OCRv6 on Hugging Face - document AI stays specialized, small, and multilingual

Reflection-SpaceX compute deal - open-source frontier AI hits a capacity wall

Arcade Series A — enterprise agents need an authorization layer, not just MCP gateways

Cloudflare Temporary Accounts — coding agents can deploy Workers without human signup flow

GitHub Code Quality GA — code governance becomes subscription plus AI metering

NVIDIA Cannes AI marketing stack — agentic workflows move into campaign operations

Adani-Jabil AI infra alliance — AI 경쟁이 모델에서 전력·랙·제조 공급망으로 확장된다

Norway school AI restrictions — 초등 AI 금지가 교육용 AI 확산의 반작용을 보여준다

Salesforce-Fin acquisition — customer service agents가 CRM suite의 핵심 실행 계층으로 편입된다

Anthropic Public Record — 미국 대중은 AI 효용보다 책임성과 규제를 먼저 요구한다

ChatGPT Enterprise spend controls — AI 도입의 병목이 모델 접근에서 비용 거버넌스로 이동

MAI-Code-1-Flash 확장 — coding model 경쟁이 Copilot surface coverage로 이동

OpenAI AI chemist — GPT-5.4가 자동화 실험실과 결합해 Chan-Lam 수율을 개선

Google UCP open rails — agentic commerce가 쇼핑 UI에서 표준 프로토콜 경쟁으로 이동

OpenAI June 2026 Threat Report — AI 논쟁 자체가 영향공작 표적이 됐다

Probably $9M seed — AI 신뢰성 경쟁이 더 큰 모델에서 deterministic harness engineering으로 이동

Google Colab CLI — agent-ready compute가 로컬 터미널에서 즉시 GPU·TPU orchestration으로 이동

OpenEnv committee launch — open agent training이 harness별 튜닝에서 공유 environment protocol로 이동

Prometheus $12B Series B — industrial AI가 chatbot에서 physical engineering cycle compression으로 이동

AI brands as bait — AI 열풍이 모델 출시 경쟁에서 social engineering 공격면 확대로 번지다

GitHub Agentic Workflows public preview — 에이전트 자동화가 YAML 작성에서 policy-aware SDLC 실행 계층으로 이동

GitHub Copilot CLI + language servers — AI 코딩이 text grep에서 semantic code intelligence 단계로 이동

AI in the Enterprise: How People Use M365 Copilot Chat — enterprise AI 채택이 검색 보조에서 문서·커뮤니케이션 작업으로 이동

Cloudflare acquires VoidZero — AI 코딩 시대의 배포 스택이 framework 선택에서 execution path 통합으로 이동

OpenRouter·Concentrate AI 부상 — LLM 경쟁이 모델 성능에서 routing economics 계층으로 이동

Claude Fable 5 — frontier model 공개가 capability race에서 guardrailed deployment 경쟁으로 이동

OpenAI S-1 confidential filing — AI 경쟁이 모델·제품 전쟁에서 자본시장 체력전으로 이동

Salesforce Agentforce layoffs — enterprise AI가 성장 서사에서 조직 재편과 제품 현실성 검증 단계로 이동

Dreaming: Better memory for a more helpful ChatGPT — AI personal memory가 saved note에서 지속적 user model로 전환

ECB AI risk letter — 금융권 AI 도입이 pilot enthusiasm에서 board-level defensive posture로 이동

Introducing Mellum2 — software engineering용 small expert model 경쟁이 giant general model에서 low-latency control layer로 이동

US House AI draft bill — 미국 AI 규제 경쟁이 state patchwork에서 federal model-development preemption으로 이동

IBM-Google Cloud Practice — enterprise agent 도입이 PoC에서 서비스 채널과 산업별 delivery asset 경쟁으로 이동

Ollama 0.30 — local AI 배포 경쟁이 모델 자체에서 runtime 호환성과 GPU 보편성으로 이동

WWDC26 Apple Intelligence APIs — on-device model access가 앱 기능에서 workflow substrate로 확장

IBM and Red Hat Project Lightwell — open source AI 시대의 공급망 보안이 clearinghouse 모델로 재편

Introducing Gemma 4 12B — local multimodal agent 실행이 16GB급 엣지 하드웨어로 내려오다

Meta Business Agent — customer support agent가 CRM 플러그인에서 메시징-native 운영 계층으로 확장

Protecting against token theft — AI endpoint 보안이 인증에서 per-request 경제성 방어로 이동

GitHub Copilot usage-based billing — AI 코딩 도구 경쟁이 모델 품질에서 token economics와 admin control로 이동

Intel Xeon 6+ — agentic AI 인프라 병목이 GPU 단일 경쟁에서 orchestration CPU·memory·network 균형으로 이동

Supabase Series F — vibe coding이 backend를 demo layer에서 agentic production substrate로 밀어올리다

Palo Alto Frontier AI Defense — AI 보안이 모델 평가에서 machine-speed 대응 체계로 이동

Redis Iris — agent stack이 prompt tuning에서 context engine 아키텍처로 이동

SAP sustainability AI agents — enterprise AI가 챗봇에서 규제 워크플로 자동화로 이동

Snowflake acquires Natoma — MCP가 실험적 연결 규약에서 enterprise governance layer로 이동

Coralogix 200M Series F: AI agent observability가 독립 인프라 카테고리로 부상

Postman AI Engineer: API 조직이 context debt를 관리하는 agentic engineering 계층

Workday DevCon 2026: enterprise agent가 HR·Finance system of record로 진입하는 검증 스택 공개

Build 2026: Microsoft가 Windows를 local agent runtime으로 전환

Cisco Cloud Control: IT 운영이 dashboard에서 agentic control plane으로 이동

Codex for every role, tool, and workflow — 코딩 에이전트가 팀 업무 플랫폼으로 확장

미국 AI 행정명령: frontier model 정책이 보안 운영 체계로 구체화

Introducing Command A+ — sovereign enterprise AI가 폐쇄형 API 의존에서 배포 가능한 open model stack으로 이동

NVIDIA Alpamayo 2 Super — autonomous driving이 perception stack에서 reasoning-first physical AI stack으로 이동

Salesforce acquires Contentful — enterprise AI가 CRM assistant에서 content orchestration layer 통합으로 이동

AWS launches Amazon Quick desktop AI assistant that works across your applications, tools, and data

Enhanced AI Management and Analytics for Organizations

Introducing Trusted Remote Execution: Policy-Enforced Scripts for AI Agents and Humans

TeamCity 2026.1: CLI, MCP for AI Agents, Pipelines Enhancements, and More