Upload the final mastered version of your chapter — the file you'd submit to ACX, Findaway, or your client. WAV, MP3, M4A, FLAC, and AIFF are all supported.
Non-WAV files are converted internally before scanning. Your original file is never modified.
NarScan needs a sample of what your room tone should sound like — a few seconds of quiet background from the same recording session, after mastering.
Pick the mode that matches your workflow. See the baseline guide below for help choosing.
Threshold (3.0×) — how much louder than baseline a section must be to get flagged. At 3.0×, a section needs to be 3 times louder than your normal room tone. Lower = more sensitive.
Min duration (500ms) — shortest anomaly worth reporting. Prevents brief breath sounds or transients from cluttering the results.
The timeline shows your file's noise floor at a glance. Grey bars are quiet non-speech sections. Red sections are anomalies. The indigo dashed line is your baseline.
The table lists each flagged section with its timestamp, duration, dB level, and severity. Take those timestamps into your DAW, listen, and fix what's real.
Borrows the last N seconds as its reference. Adjust the sample window slider to match how much room tone you typically leave at the end.
→ Best for: narrators who leave room tone or silence at the chapter tail
Borrows the first N seconds instead. Use this if you record a few seconds of silence before you start speaking, or if your tail has music or outro content.
→ Best for: narrators who record room tone at the top of the session
A dedicated file from the same session, after mastering. No guesswork — most accurate baseline possible, independent of file length or structure.
→ Best for: critical submissions, production studios, or when head/tail aren't reliable