Internformat: :: Library Catalog

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Lee, Jaewon, Eimon, Md Eimran Hossain, Srinivasan, Avinash, Kalva, Hari
Format:	Preprint
Veröffentlicht:	2026
Schlagworte:	Computer Vision and Pattern Recognition
Online-Zugang:	https://arxiv.org/abs/2604.11010
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

_version_	1866915934010081280
author	Lee, Jaewon Eimon, Md Eimran Hossain Srinivasan, Avinash Kalva, Hari
author_facet	Lee, Jaewon Eimon, Md Eimran Hossain Srinivasan, Avinash Kalva, Hari
contents	Digital forensic investigations often face significant challenges when recovering fragmented multimedia files that lack file system metadata. While traditional file carving relies on signatures and discriminative deep learning models for fragment classification, these methods cannot reconstruct or predict missing data. We propose a generative approach to multimedia carving using bGPT, a byte-level transformer designed for next-byte prediction. By feeding partial BMP image data into the model, we simulate the generation of likely fragment continuations. We evaluate the fidelity of these predictions using different metrics, namely, cosine similarity, structural similarity index (SSIM), chi-square distance, and Jensen-Shannon divergence (JSD). Our findings demonstrate that generative models can effectively predict byte-level patterns to support fragment matching in unallocated disk space.
format	Preprint
id	arxiv_https___arxiv_org_abs_2604_11010
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Byte-level generative predictions for forensics multimedia carving Lee, Jaewon Eimon, Md Eimran Hossain Srinivasan, Avinash Kalva, Hari Computer Vision and Pattern Recognition Digital forensic investigations often face significant challenges when recovering fragmented multimedia files that lack file system metadata. While traditional file carving relies on signatures and discriminative deep learning models for fragment classification, these methods cannot reconstruct or predict missing data. We propose a generative approach to multimedia carving using bGPT, a byte-level transformer designed for next-byte prediction. By feeding partial BMP image data into the model, we simulate the generation of likely fragment continuations. We evaluate the fidelity of these predictions using different metrics, namely, cosine similarity, structural similarity index (SSIM), chi-square distance, and Jensen-Shannon divergence (JSD). Our findings demonstrate that generative models can effectively predict byte-level patterns to support fragment matching in unallocated disk space.
title	Byte-level generative predictions for forensics multimedia carving
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2604.11010

Ähnliche Einträge