A Family of LZ78-based Universal Sequential Probability Assignments

Sagan, Naomi; Weissman, Tsachy

Computer Science > Information Theory

arXiv:2410.06589 (cs)

[Submitted on 9 Oct 2024]

Title:A Family of LZ78-based Universal Sequential Probability Assignments

Authors:Naomi Sagan, Tsachy Weissman

View PDF HTML (experimental)

Abstract:We propose and study a family of universal sequential probability assignments on individual sequences, based on the incremental parsing procedure of the Lempel-Ziv (LZ78) compression algorithm. We show that the normalized log loss under any of these models converges to the normalized LZ78 codelength, uniformly over all individual sequences. To establish the universality of these models, we consolidate a set of results from the literature relating finite-state compressibility to optimal log-loss under Markovian and finite-state models. We also consider some theoretical and computational properties of these models when viewed as probabilistic sources. Finally, we present experimental results showcasing the potential benefit of using this family -- as models and as sources -- for compression, generation, and classification.

Comments:	31 pages, 5 figures, submitted to IEEE Transactions on Information Theory
Subjects:	Information Theory (cs.IT)
MSC classes:	94A12 (Primary), 94A29, 94A17 (Secondary)
ACM classes:	H.1.1
Cite as:	arXiv:2410.06589 [cs.IT]
	(or arXiv:2410.06589v1 [cs.IT] for this version)
	https://doi.org/10.48550/arXiv.2410.06589

Submission history

From: Naomi Sagan [view email]
[v1] Wed, 9 Oct 2024 06:39:22 UTC (375 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.IT

< prev | next >

new | recent | 2024-10

Change to browse by:

cs
math
math.IT

References & Citations

export BibTeX citation

Computer Science > Information Theory

Title:A Family of LZ78-based Universal Sequential Probability Assignments

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Theory

Title:A Family of LZ78-based Universal Sequential Probability Assignments

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators