Language Model Applications

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

Open source Mamba 3 arrives to surpass Transformer architecture with nearly 4% improved language modeling, reduced latency

This release is good for developers building long-context applications, real-time reasoning agents, or those seeking to reduce GPU costs in high-volume production environments.

World's first Tibetan large language model unveiled in Lhasa

The world's first Tibetan large language model and its application, DeepZang, has been officially unveiled in Lhasa, ...

InfoWorld

19 large language models redefining AI safety—and danger

Whether you are looking for an LLM with more safety guardrails or one completely without them, someone has probably built it.

‘100 Video Calls Per Day’: Models Are Applying to Be the Face of AI Scams

Dozens of Telegram channels reviewed by WIRED include job listings for “AI face models.” The (mostly) women who land these ...

New Technology Brings Advanced Language Models to Everyday Devices

A Stanford engineer has demonstrated that frontier language models can run directly on everyday edge devices using convex ...

Nvidia expands open AI model portfolio and enlists partners for frontier development

Touting its status as the “world’s largest contributor to open-source AI,” Nvidia Corp. is doubling down on open artificial ...

Vietnam Investment Review on MSN

Ping An financial language model tops CNFINBENCH evaluation

HONG KONG and SHANGHAI, March 15, 2026 /PRNewswire/ -- Ping An Insurance (Group) Company of China, Ltd. ("Ping An" or "the Group"; HKEX: 2318/82318; SSE: 601318) announced that PingAnGPT-Qwen3-32B, ...

OpenAI, Mistral AI release new hardware-efficient language models

OpenAI Group PBC and Mistral AI SAS today introduced new artificial intelligence models optimized for cost-sensitive use cases. OpenAI is rolling out two algorithms called GPT-5.4 mini and GPT 5.4 ...

PCMag

Meta's Manus Launches Desktop App With AI Agent for Tasks Across Files, Apps

Meta’s recently acquired AI startup Manus has launched a desktop app for Mac and Windows. It features an agentic tool called ...

Industry-Leading AI Model APIs: Navigating Cost Efficiency and Performance in the 2026 Generative AI Stack

SINGAPORE, SINGAPORE, SINGAPORE, March 20, 2026 /EINPresswire.com/ -- As we navigate the sophisticated landscape of ...

Military.com

Cognitive Warfare and the Modeling of Human Behavior

Cognitive warfare technologies now model and simulate human behavior at scale, raising concerns about autonomous digital ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results