Fine-Tuning Audio Compression: Algorithmic Implementation and Performance Metrics

Authors

  • Umer Ijaz Department of Electrical Engineering & Technology, GC University, Faisalabad
  • Fouzia Gillani Department of Mechanical Engineering and Technology, GC University, Faisalabad
  • Ali Iqbal Department of Electrical Engineering & Technology, GC University, Faisalabad
  • Muhammad Saad Sharif Department of Electrical Engineering & Technology, GC University, Faisalabad
  • Muhammad Fraz Anwar Department of Electrical Engineering & Technology, GC University, Faisalabad
  • Abubaker Ijaz WASA, Faisalabad

Keywords:

Audio Compression, Algorithm Evaluation, MP3 Compression, LPC Compression, Wavelet Compression, Subband Compression, Performance Metrics, Comparative Study, Digital Signal Processing, Multimedia Applications

Abstract

Introduction/Importance of Study:

This study introduces a comprehensive evaluation of audio compression algorithms to address the increasing demand for efficient data compression techniques in various audio processing applications.

Novelty statement:

Our research contributes novel insights into the comparative analysis of audio compression algorithms, offering a systematic approach to assess performance across multiple dimensions.

Material and Method:

The research methodology involved the selection of a diverse dataset comprising five audio files, rigorous implementation of four prominent compression algorithms, and systematic evaluation of performance metrics.

Results and Discussion:

The abstract primarily focuses on presenting the findings of the comparative analysis, highlighting the performance of MP3, LPC, Wavelet, and Sub band algorithms across various evaluation parameters.

Concluding Remarks:

In conclusion, our study identifies Wavelet compression as the optimal choice among the evaluated algorithms, offering exceptional accuracy, perceptual quality, and minimal distortion in audio compression.

Author Biographies

Umer Ijaz, Department of Electrical Engineering & Technology, GC University, Faisalabad

Assistant Professor

Department of Electrical Engineering & Technology, GC University, Faisalabad

Fouzia Gillani, Department of Mechanical Engineering and Technology, GC University, Faisalabad

Assistant Professor

Department of Mechanical Engineering and Technology, GC University, Faisalabad

Ali Iqbal, Department of Electrical Engineering & Technology, GC University, Faisalabad

Lecturer

Department of Electrical Engineering & Technology, GC University, Faisalabad

Muhammad Saad Sharif, Department of Electrical Engineering & Technology, GC University, Faisalabad

Lecturer

Department of Electrical Engineering & Technology, GC University, Faisalabad

Muhammad Fraz Anwar, Department of Electrical Engineering & Technology, GC University, Faisalabad

Teaching Assistant

Department of Electrical Engineering & Technology, GC University, Faisalabad

Abubaker Ijaz, WASA, Faisalabad

Director Development

WASA Faisalabad

References

T. Hidayat, M. H. Zakaria, and A. N. C. Pee, “A critical assessment of advanced coding standards for lossless audio compression,” Int. J. Simul. Syst. Sci. Technol., vol. 19, no. 5, pp. 31.1-31.10, Oct. 2018, doi: 10.5013/IJSSST.A.19.05.31.

A. P. Reddy and V. Vijayarajan, “Audio compression with multi-algorithm fusion and its impact in speech emotion recognition,” Int. J. Speech Technol., vol. 23, no. 2, pp. 277–285, Jun. 2020, doi: 10.1007/S10772-020-09689-9/METRICS.

E. W. Abood et al., “Provably secure and efficient audio compression based on compressive sensing,” Int. J. Electr. Comput. Eng., vol. 13, no. 1, pp. 335–346, Feb. 2023, doi: 10.11591/IJECE.V13I1.PP335-346.

M. Bosi and R. E. Goldberg, “Introduction to Digital Audio Coding and Standards,” Introd. to Digit. Audio Coding Stand., 2003, doi: 10.1007/978-1-4615-0327-9.

S. Shukla, M. Ahirwar, R. Gupta, S. Jain, and D. S. Rajput, “Audio Compression Algorithm using Discrete Cosine Transform (DCT) and Lempel-Ziv-Welch (LZW) Encoding Method,” Proc. Int. Conf. Mach. Learn. Big Data, Cloud Parallel Comput. Trends, Prespectives Prospect. Com. 2019, pp. 476–480, Feb. 2019, doi: 10.1109/COMITCON.2019.8862228.

Z. J. Ahmed, L. E. George, and R. A. Hadi, “Audio compression using transforms and high order entropy encoding,” Int. J. Electr. Comput. Eng., vol. 11, no. 4, pp. 3459–3469, Aug. 2021, doi: 10.11591/IJECE.V11I4.PP3459-3469.

A. O. Salau, I. Oluwafemi, K. F. Faleye, and S. Jain, “Audio Compression Using a Modified Discrete Cosine Transform with Temporal Auditory Masking,” 2019 Int. Conf. Signal Process. Commun. ICSC 2019, pp. 135–142, Mar. 2019, doi: 10.1109/ICSC45622.2019.8938213.

A. O. Timothy and G. A. Junior, “Embedding Text in Audio Steganography System using Advanced Encryption Standard, Text Compression and Spread Spectrum Techniques in Mp3 and Mp4 File Formats,” Int. J. Comput. Appl., vol. 177, no. 41, pp. 975–8887, 2020.

S. Prince, D. Bini, A. A. Kirubaraj, S. J. Immanuel, and M. Surya, “Audio Compression using a Modified Vector Quantization algorithm for Mastering Applications,” Int. J. Electron. Telecommun., vol. 69, no. 2, pp. 287–292, 2023, doi: 10.24425/IJET.2023.144363.

J. McFarlane and B. R. Chakravarthi, “MP3 compression classification through audio analysis statistics.” Audio Engineering Society, May 02, 2022. Accessed: Mar. 03, 2024. [Online]. Available: http://www.aes.org/e-lib

B. Gold, N. Morgan, and D. Ellis, “Speech and Audio Signal Processing: Processing and Perception of Speech and Music: Second Edition,” Speech Audio Signal Process. Process. Percept. Speech Music Second Ed., Oct. 2011, doi: 10.1002/9781118142882.

“Discrete-Time Processing of Speech Signals | IEEE eBooks | IEEE Xplore.” Accessed: Mar. 03, 2024. [Online]. Available: https://ieeexplore.ieee.org/book/5266102

X. Liu, H. Tian, Y. Huang, and J. Lu, “A novel steganographic method for algebraic-code-excited-linear-prediction speech streams based on fractional pitch delay search,” Multimed. Tools Appl., vol. 78, no. 7, pp. 8447–8461, Apr. 2019, doi: 10.1007/S11042-018-6867-7/METRICS.

X. Jiang, X. Peng, H. Xue, Y. Zhang, and Y. Lu, “Latent-Domain Predictive Neural Speech Coding,” 2023, doi: 10.1109/TASLP.2023.3277693.

C. Chen, L. Zhang, and R. L. K. Tiong, “A new lossy compression algorithm for wireless sensor networks using Bayesian predictive coding,” Wirel. Networks, vol. 26, no. 8, pp. 5981–5995, Nov. 2020, doi: 10.1007/S11276-020-02425-W/METRICS.

S. Shukla, R. Gupta, D. S. Rajput, Y. Goswami, and V. Sharma, “A Comparative Analysis of Lossless Compression Algorithms on Uniformly Quantized Audio Signals,” Int. J. Image, Graph. Signal Process., vol. 14, no. 6, pp. 59–69, Dec. 2022, doi: 10.5815/IJIGSP.2022.06.05.

et al. Välimäki, Vesa, “Subband synthesis in audio compression,” IEEE Signal Process. Mag., vol. 35, no. 5, pp. 106–126, 2018.

T. P. Zieliński, “Audio Compression,” Textb. Telecommun. Eng., vol. Part F1370, pp. 405–437, 2021, doi: 10.1007/978-3-030-49256-4_15/COVER.

Z.-N. Li, M. S. Drew, and J. Liu, “Basic Audio Compression Techniques,” pp. 479–504, 2021, doi: 10.1007/978-3-030-62124-7_13.

“SIPI Image Database - Misc.” Accessed: Dec. 02, 2023. [Online]. Available: https://sipi.usc.edu/database/database.php?volume=misc

S. T. Abdulrazzaq, M. M. Siddeq, and M. A. Rodrigues, “A Novel Steganography Approach for Audio Files,” SN Comput. Sci., vol. 1, no. 2, pp. 1–13, 2020, doi: 10.1007/s42979-020-0080-2.

N. F. Soliman, M. I. Khalil, A. D. Algarni, S. Ismail, R. Marzouk, and W. El-Shafai, “Efficient HEVC steganography approach based on audio compression and encryption in QFFT domain for secure multimedia communication,” Multimed. Tools Appl., vol. 80, no. 3, pp. 4789–4823, Jan. 2021, doi: 10.1007/S11042-020-09881-8/METRICS.

H. Gamper, C. K. A. Reddy, R. Cutler, I. J. Tashev, and J. Gehrke, “Intrusive and non-intrusive perceptual speech quality assessment using a convolutional neural network,” IEEE Work. Appl. Signal Process. to Audio Acoust., vol. 2019-October, pp. 85–89, Oct. 2019, doi: 10.1109/WASPAA.2019.8937202.

M. Talbi and M. Salim Bouhlel, “New Speech Compression Technique based on Filter Bank Design and Psychoacoustic Model”, doi: 10.20855/ijav.2019.24.41455.

K. Kąkol, G. Korvel, and B. Kostek, “Improving Objective Speech Quality Indicators in Noise Conditions,” Stud. Comput. Intell., vol. 869, pp. 199–218, 2020, doi: 10.1007/978-3-030-39250-5_11/COVER.

R. Din and A. J. Qasim, “Steganography analysis techniques applied to audio and image files,” Bull. Electr. Eng. Informatics, vol. 8, no. 4, pp. 1297–1302, Dec. 2019, doi: 10.11591/EEI.V8I4.1626.

A. S. Abosinnee and Z. M. Hussain, “STATISTICAL VS. INFORMATION-THEORETIC SIGNAL PROPERTIES OVER FFT-OFDM,” J. Theor. Appl. Inf. Technol., vol. 97, p. 22, 2019, Accessed: Mar. 03, 2024. [Online]. Available: www.jatit.org

A. G. Ramirez-Aristizabal and C. Kello, “EEG2Mel: Reconstructing Sound from Brain Responses to Music,” Jul. 2022, Accessed: Mar. 03, 2024. [Online]. Available: https://arxiv.org/abs/2207.13845v1

L. Amaya and E. Inga, “Compressed Sensing Technique for the Localization of Harmonic Distortions in Electrical Power Systems,” Sensors 2022, Vol. 22, Page 6434, vol. 22, no. 17, p. 6434, Aug. 2022, doi: 10.3390/S22176434.

P. Burrascano, A. Terenzi, S. Cecchi, M. Ciuffetti, and S. Spinsante, “A Swept-Sine-Type Single Measurement to Estimate Intermodulation Distortion in a Dynamic Range of Audio Signal Amplitudes,” IEEE Trans. Instrum. Meas., vol. 70, 2021, doi: 10.1109/TIM.2021.3077983.

A. Alaei, S. M. Saghaeian Nejad, J. F. Gieras, D. Lee, and J. Ahn, “Reduction of high‐frequency injection losses, acoustic noise and total harmonic distortion in IPMSM sensorless drives,” IET Power Electron., vol. 12, no. 12, pp. 3197–3207, Oct. 2019, doi: 10.1049/IET-PEL.2018.6250.

Downloads

Published

2024-03-05

How to Cite

Ijaz, U., Gillani, F., Iqbal, A., Sharif, M. S., Anwar, M. F., & Ijaz, A. (2024). Fine-Tuning Audio Compression: Algorithmic Implementation and Performance Metrics. International Journal of Innovations in Science & Technology, 6(1), 220–236. Retrieved from https://journal.50sea.com/index.php/IJIST/article/view/714