Thanks for your reply Sami
I think your formula is not correct, instead of 2^24 it should probably be 24 since 24 bits (3 bytes) are sampled 48000 times per second -> ~8MB for 1 minute which makes sense since an audio CD is ~650MB for 44100Hz/16Bit/Stereo. I don't think memory is the issue in this case...