Format-based Watermarking
- Electronic marking and identification techniques to discourage document copying.
- UniSpaCh: A text-based data hiding method using Unicode space characters.
- Content-preserving text watermarking through unicode homoglyph substitution.
- Embarrassingly simple text watermarks.
Lexical-based Watermarking
- Deeptextmark: Deep learning based text watermarking for detection of large language model generated text.
- The hiding virtues of ambiguity: quantifiably resilient watermarking of natural language text through synonym substitutions.
- Watermarking text generated by black-box language models.
- Robust multi-bit natural language watermarking through invariant features.
Syntactic-based Watermarking
- Natural language watermarking: Design, analysis, and a proof-of-concept implementation.
- Natural language watermarking via morphosyntactic alterations.
- Words are not enough: sentence level natural language watermarking.
Generation-based Watermarking
- Adversarial watermarking transformer: Towards tracing text provenance with data hiding.
- Waterfall: Framework for Robust and Scalable Text Watermarking.
- REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models.
- PostMark: A Robust Blackbox Watermark for Large Language Models. (Future)