High-Resolution Audio Processing Toolkit
A professional-grade audio processing application that enhances and restores audio quality using advanced signal processing algorithms and psychoacoustic models.
Restores high-frequency content lost due to lossy compression or bandwidth limitation.
- HFP V2 (MDCT-based): Uses Modified Discrete Cosine Transform with harmonic structure analysis
- HFP V1 (STFT-based): Short-Time Fourier Transform based approach with HPSS separation
Psychoacoustically-informed resampler with adaptive mixed-phase filtering.
- Supports: 32kHz, 44.1kHz, 48kHz, 88.2kHz, 96kHz, 192kHz
- Rate-Distortion Optimization (RDO) for minimal perceptual loss
Advanced dithering with psychoacoustic noise shaping.
- Supports 16-bit, 24-bit PCM and 32-bit Float
- Adaptive LPC-based noise shaping filter
- Configurable LPC order (1-32)
Cross-platform GUI built with Flet framework.
- Dark theme with glassmorphism design
- Batch processing with file list management
- Real-time progress tracking
- Bilingual support (English/Japanese)
- Python 3.8 or higher
- pip package manager
pip install -r requirements.txtnumpy>= 1.21.0scipy>= 1.7.0librosa>= 0.9.0soundfile>= 0.10.0numba>= 0.55.0flet>= 0.20.0
python -m HRAudioWizardOr run the Flet UI directly:
python flet_ui.pyfrom HRAudioWizard import HighFrequencyRestorer, RDONoiseShaper, IntelligentRDOResampler
import librosa
import soundfile as sf
# Load audio
audio, sr = librosa.load("input.mp3", sr=None, mono=False)
audio = audio.T # (samples, channels)
# High-frequency restoration
restorer = HighFrequencyRestorer()
restored = restorer.run_hfpv2(audio, sr, lowpass=-1, enable_compressed_fix=True)
# Resample to 96kHz
resampler = IntelligentRDOResampler(sr, 96000)
resampled = resampler.resample(restored)
# Apply noise shaping for 24-bit output
shaper = RDONoiseShaper(24, 96000, lpc_order=16)
output = shaper.process(resampled)
# Save
sf.write("output.wav", output, 96000, subtype="PCM_24")HRAudioWizard/
├── __init__.py # Package initialization
├── __main__.py # Entry point
├── flet_ui.py # Cross-platform GUI (Flet)
├── hfr.py # HFPv2 high-frequency restoration
├── hfp_v1.py # HFP V1 (STFT-based) algorithm
├── mdct.py # MDCT/IMDCT implementation
├── griffin_lim.py # Griffin-Lim phase reconstruction
├── noise_shaper.py # RDO noise shaping/dithering
├── resampler.py # Intelligent RDO resampler
├── psychoacoustic.py # Psychoacoustic model
├── spectra_utils.py # Spectral utilities
├── numba_utils.py # JIT-compiled functions
├── localization.py # i18n support (EN/JP)
└── requirements.txt # Dependencies
- Mid/Side decomposition for stereo processing
- Transient detection using onset detection
- MDCT transformation and HPSS separation
- Harmonic structure analysis via cepstrum
- Overtone extrapolation based on correlation analysis
- Griffin-Lim phase estimation
- Spectral connection with smooth crossfade
- Psychoacoustic analysis using masking curves
- LPC coefficient calculation (Levinson-Durbin)
- Adaptive filter design based on masking threshold
- TPDF dithering with feedback error shaping
Input: WAV, FLAC, MP3, OGG, AIFF
Output: WAV (16-bit, 24-bit PCM / 32-bit Float)
MIT License
HRAudioWizard Team / SYH99999