Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MKV audio extract fail with Segfault #4

Open
VigibotDev opened this issue Feb 13, 2024 · 3 comments
Open

MKV audio extract fail with Segfault #4

VigibotDev opened this issue Feb 13, 2024 · 3 comments

Comments

@VigibotDev
Copy link

Hello, I need to deduplicate video file with many codec like AC3 5.1 / EAC3 5.1 and more than one audio track.
I think I need to write the ffmpeg part myself to downscale and extract all audio stream as a pcm_s16le ?

Else it always end with segfault :

...
    Stream #0:0(fre): Audio: pcm_s16le ([1][0][0][0] / 0x0001), 44100 Hz, 5.1(side), s16, 4233 kb/s (default)
    Metadata:
      title           : Français VFI AC3 5.1 @ 448 Kbps
      BPS             : 448000
      BPS-eng         : 448000
      DURATION        : 02:28:51.680000000
      DURATION-eng    : 02:28:51.680000000
      NUMBER_OF_FRAMES: 279115
      NUMBER_OF_FRAMES-eng: 279115
      NUMBER_OF_BYTES : 500174080
      NUMBER_OF_BYTES-eng: 500174080
      _STATISTICS_WRITING_APP: mkvmerge v7.9.0 ('Birds') 64bit
      _STATISTICS_WRITING_APP-eng: mkvmerge v7.9.0 ('Birds') 64bit
      _STATISTICS_WRITING_DATE_UTC: 2015-11-04 17:32:41
      _STATISTICS_WRITING_DATE_UTC-eng: 2015-11-04 17:32:41
      _STATISTICS_TAGS: BPS DURATION NUMBER_OF_FRAMES NUMBER_OF_BYTES
      _STATISTICS_TAGS-eng: BPS DURATION NUMBER_OF_FRAMES NUMBER_OF_BYTES
      encoder         : Lavc58.91.100 pcm_s16le
[wav @ 0x558a274d1cc0] Filesize 4726645260 invalid for wav, output file will be broken
size= 4615865kB time=02:28:51.68 bitrate=4233.6kbits/s speed= 310x
video:0kB audio:4615864kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 0.000004%
/tmp/Test movie title.mkv

Test movie title

Erreur de segmentation
@gurushida
Copy link
Owner

The problem here seems to be that the input file is long enough to produce a wav file that will be larger than 4 gigabytes (4726645260 bytes), which is not supported by the wav format (see https://superuser.com/a/1523649). Unless you are willing to implement support for the RF64 header that would allow to support such larges files, the solution would be to split your input file in shorter clips and index each clip individually.

@VigibotDev
Copy link
Author

VigibotDev commented Feb 15, 2024

The problem here seems to be that the input file is long enough to produce a wav file that will be larger than 4 gigabytes (4726645260 bytes), which is not supported by the wav format (see https://superuser.com/a/1523649). Unless you are willing to implement support for the RF64 header that would allow to support such larges files, the solution would be to split your input file in shorter clips and index each clip individually.

Very interesting. I tried Olaf in advanced maneer (only the C binary usage) but the performance not match my search. I look for audio fingerprint permit the detection of a duplicate sound track in a collection of 24198+ videos.
N * N = 585543204 comparison (minus 1xN for the video itself). but I need a similarities percentage for each N*N result to make interesting data....

@VigibotDev
Copy link
Author

I need complete audio (about 2 hour duration average) fingerprint to get similarity factor or percentage for N*N.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants