Semantic Length Limits in LLM Based Steganography
DOI:
https://doi.org/10.32473/flairs.39.1.141579Abstract
The Calgacus protocol enables LLM-based steganography through rank-based token encoding, but its operational length limits remain poorly characterized. We conduct 2,600 encoding trials across 10–500 tokens using 10 distinct key-prefix scenarios. Breakdown thresholds vary 22.5-fold (20 to 450 tokens) depending solely on scenario selection, demonstrating that length limits are semantic rather than technical. Rank statistics predict robustness, with low-rank scenarios (mean rank <25) supporting substantially longer messages. These findings expose security risks; adversaries with optimized key-prefix pairs can transmit messages 20× longer than theoretical constraints suggest, fundamentally altering threat models for LLM-mediated covert channels.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Haley Stinebrickner, Alexander V. Mantzaris, Wissam Ghantous

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.