Fine-Tuning Audio Compression: Algorithmic Implementation and Performance Metrics
Keywords:
Audio Compression, Algorithm Evaluation, MP3 Compression, LPC Compression, Wavelet Compression, Subband Compression, Performance Metrics, Comparative Study, Digital Signal Processing, Multimedia ApplicationsAbstract
Introduction/Importance of Study:
This study introduces a comprehensive evaluation of audio compression algorithms to address the increasing demand for efficient data compression techniques in various audio processing applications.
Novelty statement:
Our research contributes novel insights into the comparative analysis of audio compression algorithms, offering a systematic approach to assess performance across multiple dimensions.
Material and Method:
The research methodology involved the selection of a diverse dataset comprising five audio files, rigorous implementation of four prominent compression algorithms, and systematic evaluation of performance metrics.
Results and Discussion:
The abstract primarily focuses on presenting the findings of the comparative analysis, highlighting the performance of MP3, LPC, Wavelet, and Sub band algorithms across various evaluation parameters.
Concluding Remarks:
In conclusion, our study identifies Wavelet compression as the optimal choice among the evaluated algorithms, offering exceptional accuracy, perceptual quality, and minimal distortion in audio compression.
References
T. Hidayat, M. H. Zakaria, and A. N. C. Pee, “A critical assessment of advanced coding standards for lossless audio compression,” Int. J. Simul. Syst. Sci. Technol., vol. 19, no. 5, pp. 31.1-31.10, Oct. 2018, doi: 10.5013/IJSSST.A.19.05.31.
A. P. Reddy and V. Vijayarajan, “Audio compression with multi-algorithm fusion and its impact in speech emotion recognition,” Int. J. Speech Technol., vol. 23, no. 2, pp. 277–285, Jun. 2020, doi: 10.1007/S10772-020-09689-9/METRICS.
E. W. Abood et al., “Provably secure and efficient audio compression based on compressive sensing,” Int. J. Electr. Comput. Eng., vol. 13, no. 1, pp. 335–346, Feb. 2023, doi: 10.11591/IJECE.V13I1.PP335-346.
M. Bosi and R. E. Goldberg, “Introduction to Digital Audio Coding and Standards,” Introd. to Digit. Audio Coding Stand., 2003, doi: 10.1007/978-1-4615-0327-9.
S. Shukla, M. Ahirwar, R. Gupta, S. Jain, and D. S. Rajput, “Audio Compression Algorithm using Discrete Cosine Transform (DCT) and Lempel-Ziv-Welch (LZW) Encoding Method,” Proc. Int. Conf. Mach. Learn. Big Data, Cloud Parallel Comput. Trends, Prespectives Prospect. Com. 2019, pp. 476–480, Feb. 2019, doi: 10.1109/COMITCON.2019.8862228.
Z. J. Ahmed, L. E. George, and R. A. Hadi, “Audio compression using transforms and high order entropy encoding,” Int. J. Electr. Comput. Eng., vol. 11, no. 4, pp. 3459–3469, Aug. 2021, doi: 10.11591/IJECE.V11I4.PP3459-3469.
A. O. Salau, I. Oluwafemi, K. F. Faleye, and S. Jain, “Audio Compression Using a Modified Discrete Cosine Transform with Temporal Auditory Masking,” 2019 Int. Conf. Signal Process. Commun. ICSC 2019, pp. 135–142, Mar. 2019, doi: 10.1109/ICSC45622.2019.8938213.
A. O. Timothy and G. A. Junior, “Embedding Text in Audio Steganography System using Advanced Encryption Standard, Text Compression and Spread Spectrum Techniques in Mp3 and Mp4 File Formats,” Int. J. Comput. Appl., vol. 177, no. 41, pp. 975–8887, 2020.
S. Prince, D. Bini, A. A. Kirubaraj, S. J. Immanuel, and M. Surya, “Audio Compression using a Modified Vector Quantization algorithm for Mastering Applications,” Int. J. Electron. Telecommun., vol. 69, no. 2, pp. 287–292, 2023, doi: 10.24425/IJET.2023.144363.
J. McFarlane and B. R. Chakravarthi, “MP3 compression classification through audio analysis statistics.” Audio Engineering Society, May 02, 2022. Accessed: Mar. 03, 2024. [Online]. Available: http://www.aes.org/e-lib
B. Gold, N. Morgan, and D. Ellis, “Speech and Audio Signal Processing: Processing and Perception of Speech and Music: Second Edition,” Speech Audio Signal Process. Process. Percept. Speech Music Second Ed., Oct. 2011, doi: 10.1002/9781118142882.
“Discrete-Time Processing of Speech Signals | IEEE eBooks | IEEE Xplore.” Accessed: Mar. 03, 2024. [Online]. Available: https://ieeexplore.ieee.org/book/5266102
X. Liu, H. Tian, Y. Huang, and J. Lu, “A novel steganographic method for algebraic-code-excited-linear-prediction speech streams based on fractional pitch delay search,” Multimed. Tools Appl., vol. 78, no. 7, pp. 8447–8461, Apr. 2019, doi: 10.1007/S11042-018-6867-7/METRICS.
X. Jiang, X. Peng, H. Xue, Y. Zhang, and Y. Lu, “Latent-Domain Predictive Neural Speech Coding,” 2023, doi: 10.1109/TASLP.2023.3277693.
C. Chen, L. Zhang, and R. L. K. Tiong, “A new lossy compression algorithm for wireless sensor networks using Bayesian predictive coding,” Wirel. Networks, vol. 26, no. 8, pp. 5981–5995, Nov. 2020, doi: 10.1007/S11276-020-02425-W/METRICS.
S. Shukla, R. Gupta, D. S. Rajput, Y. Goswami, and V. Sharma, “A Comparative Analysis of Lossless Compression Algorithms on Uniformly Quantized Audio Signals,” Int. J. Image, Graph. Signal Process., vol. 14, no. 6, pp. 59–69, Dec. 2022, doi: 10.5815/IJIGSP.2022.06.05.
et al. Välimäki, Vesa, “Subband synthesis in audio compression,” IEEE Signal Process. Mag., vol. 35, no. 5, pp. 106–126, 2018.
T. P. Zieliński, “Audio Compression,” Textb. Telecommun. Eng., vol. Part F1370, pp. 405–437, 2021, doi: 10.1007/978-3-030-49256-4_15/COVER.
Z.-N. Li, M. S. Drew, and J. Liu, “Basic Audio Compression Techniques,” pp. 479–504, 2021, doi: 10.1007/978-3-030-62124-7_13.
“SIPI Image Database - Misc.” Accessed: Dec. 02, 2023. [Online]. Available: https://sipi.usc.edu/database/database.php?volume=misc
S. T. Abdulrazzaq, M. M. Siddeq, and M. A. Rodrigues, “A Novel Steganography Approach for Audio Files,” SN Comput. Sci., vol. 1, no. 2, pp. 1–13, 2020, doi: 10.1007/s42979-020-0080-2.
N. F. Soliman, M. I. Khalil, A. D. Algarni, S. Ismail, R. Marzouk, and W. El-Shafai, “Efficient HEVC steganography approach based on audio compression and encryption in QFFT domain for secure multimedia communication,” Multimed. Tools Appl., vol. 80, no. 3, pp. 4789–4823, Jan. 2021, doi: 10.1007/S11042-020-09881-8/METRICS.
H. Gamper, C. K. A. Reddy, R. Cutler, I. J. Tashev, and J. Gehrke, “Intrusive and non-intrusive perceptual speech quality assessment using a convolutional neural network,” IEEE Work. Appl. Signal Process. to Audio Acoust., vol. 2019-October, pp. 85–89, Oct. 2019, doi: 10.1109/WASPAA.2019.8937202.
M. Talbi and M. Salim Bouhlel, “New Speech Compression Technique based on Filter Bank Design and Psychoacoustic Model”, doi: 10.20855/ijav.2019.24.41455.
K. Kąkol, G. Korvel, and B. Kostek, “Improving Objective Speech Quality Indicators in Noise Conditions,” Stud. Comput. Intell., vol. 869, pp. 199–218, 2020, doi: 10.1007/978-3-030-39250-5_11/COVER.
R. Din and A. J. Qasim, “Steganography analysis techniques applied to audio and image files,” Bull. Electr. Eng. Informatics, vol. 8, no. 4, pp. 1297–1302, Dec. 2019, doi: 10.11591/EEI.V8I4.1626.
A. S. Abosinnee and Z. M. Hussain, “STATISTICAL VS. INFORMATION-THEORETIC SIGNAL PROPERTIES OVER FFT-OFDM,” J. Theor. Appl. Inf. Technol., vol. 97, p. 22, 2019, Accessed: Mar. 03, 2024. [Online]. Available: www.jatit.org
A. G. Ramirez-Aristizabal and C. Kello, “EEG2Mel: Reconstructing Sound from Brain Responses to Music,” Jul. 2022, Accessed: Mar. 03, 2024. [Online]. Available: https://arxiv.org/abs/2207.13845v1
L. Amaya and E. Inga, “Compressed Sensing Technique for the Localization of Harmonic Distortions in Electrical Power Systems,” Sensors 2022, Vol. 22, Page 6434, vol. 22, no. 17, p. 6434, Aug. 2022, doi: 10.3390/S22176434.
P. Burrascano, A. Terenzi, S. Cecchi, M. Ciuffetti, and S. Spinsante, “A Swept-Sine-Type Single Measurement to Estimate Intermodulation Distortion in a Dynamic Range of Audio Signal Amplitudes,” IEEE Trans. Instrum. Meas., vol. 70, 2021, doi: 10.1109/TIM.2021.3077983.
A. Alaei, S. M. Saghaeian Nejad, J. F. Gieras, D. Lee, and J. Ahn, “Reduction of high‐frequency injection losses, acoustic noise and total harmonic distortion in IPMSM sensorless drives,” IET Power Electron., vol. 12, no. 12, pp. 3197–3207, Oct. 2019, doi: 10.1049/IET-PEL.2018.6250.
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 50SEA
This work is licensed under a Creative Commons Attribution 4.0 International License.