Skip to content

litenup/audio_hero

Repository files navigation

AudioHero

This is a ruby wrapper for sox, the swiss army knife for audio. I've implemented sensible defaults for basic operations so converting mp3 to wav is as simple as:

AudioHero::Sox.new(mp3file).convert

Installation

Requirements: Obviously, make sure your system has sox installed already.

Add this line to your application's Gemfile:

gem 'audio_hero'

And then execute:

$ bundle

Or install it yourself as:

$ gem install audio_hero

Usage

###Convert

Return the converted audio as ruby temp file object

require "open-uri"

file = open("url/or/path/to/audio/file")

new_file = AudioHero::Sox.new(file).convert({output_options: "-c 1", output_format: "mp3", channel: "left"})
# Optional: Close the file after you are done for garbage collection.
new_file.close!

# AudioHero can also take a direct filepath string as input

file_path = open("url/or/path/to/audio/file").path # "/path/to/tempfile"
new_file = AudioHero::Sox.new(file_path).convert({output_options: "-c 1", output_format: "mp3", channel: "left"})

# With no options hash it will runs this command by default: `sox -t input.mp3 -c 1 -b 16 -r 16k out.wav`
AudioHero::Sox.new(file).convert

Options(hash):

  • channel (set "left", "right", or leave blank for stereo)
  • input_format (default to "mp3")
  • output_format (default to "wav")
  • output_options (default to "-c 1 -b 16 -r 16k", single channel, 16hz, 16bits)
  • gc (no default, set to "true" to auto close! input file)

###Remove Silence

file = AudioHero::Sox.new(file).remove_silence({input_format: "wav", gc: "true"})

Options(hash):

  • above_period_duration (default to 0.1 seconds)
  • above_period_threshold silence_level (default to 0.03%)
  • below_period_duration (default to 0.1 seconds)
  • below_period_threshold silence_level (default to 0.03%)
  • input_format (default to "mp3")
  • output_format (default to "wav")
  • gc (no default, set to "true" to auto close! input file)

###Split into multiple files by silence

file_array = AudioHero::Sox.new(file).split_by_silence({input_format: "wav", gc: "true"})
# file_array == ["path/to/out001.wav", "path/to/out002.wav"]

Options(hash):

  • above_period_duration (default to 0.5 seconds)
  • above_period_threshold silence_level (default to 0.05%)
  • below_period_duration (default to 1.0 seconds)
  • below_period_threshold silence_level (default to 0.02%)
  • input_format (default to "mp3")
  • output_format (default to "wav")
  • output_filename (base filename, default to "out". If using custom version of SOX, use "%,1c" for starttime and use "%1,1c-%1,1d" for starttime-endtime as filename)
  • gc (no default, set to "true" to auto close! input file)

###Concatenate multiple audio files into one

file_array = ["file/location/1.mp3","file/location/2.mp3","file/location/3.mp3"]
file = AudioHero::Sox.new(file_array).concat({output_format: "mp3")

Options(hash):

  • output_format (default to "wav")

###Trim Cuts portions out of the audio. Any number of positions may be given. Usage: file = AudioHero::Sox.new(file).trim({trim_positions: "=10 =20 =30 =40"}) See sox trim effect for more examples Default output to 16bit 16 sample rate Wave audio

file = AudioHero::Sox.new(file).trim({trim_positions: "=10 =20 =30 =40", gc: "true"})

Options(hash):

  • trim_positions ("=10 =20 =30 =40" means retrieving audio from 10s-20s and from 30s-40s)
  • input_format (default to "mp3")
  • output_format (default to "wav")
  • output_options (default to "-c 1 -b 16 -r 16k", single channel, 16hz, 16bits)
  • gc (no default, set to "true" to auto close! input file)

###Stats Get statistics report on audio file (support up to 2 channels).

stats = AudioHero::Sox.new(file).stats({input_format: "wav"})
# {"dc_offset"=>"0.000398", "min_level"=>"-0.299591", "max_level"=>"0.303711", "pk_lev_db"=>"-10.35", "rms_lev_db"=>"-23.08", "rms_pk_db"=>"-16.19", "rms_tr_db"=>"-96.84", "crest_factor"=>"4.33", "flat_factor"=>"0.00", "pk_count"=>"2", "bit-depth"=>"15/16", "num_samples"=>"233k", "length_s"=>"14.544", "scale_max"=>"1.000000", "window_s"=>"0.050"}

Options(hash):

  • input_format (default to "mp3")
  • gc (no default, set to "true" to auto close! input file)

###Extract Features Extract audio features using custom version of Yaafe script.

features = AudioHero::Sox.new(file).extract_features({sample_rate: "8000"})
# {"feature1"=>0.34,"feature2"=>2.25}

Options(hash):

  • sample_rate (default to "8000")
  • gc (no default, set to "true" to auto close! input file)

###Custom Command

Run any sox command with this method.

file = AudioHero::Sox.new(file).command({global: "-V3", input_option: "-t mp3", output_option: "-c 1", effect: "remix 0 1", gc: "true"})
# Command generated: sox -V3 -t mp3 input.mp3 -c 1 output.wav remix 0 1

Options(hash):

  • global (sox global options, inserted right after sox)
  • input_options (sox input options, inserted before input file)
  • output_options (sox output options, inserted before output file)
  • effect (sox effect, inserted after output file)
  • gc (no default, set to "true" to auto close! input file)

Development

After checking out the repo, run bin/setup to install dependencies. Then, run rake spec to run the tests. You can also run bin/console for an interactive prompt that will allow you to experiment.

To install this gem onto your local machine, run bundle exec rake install. To release a new version, update the version number in version.rb, and then run bundle exec rake release, which will create a git tag for the version, push git commits and tags, and push the .gem file to rubygems.org.

Contributing

Bug reports and pull requests are welcome on GitHub at https://github.com/litenup/audio_hero. This project is intended to be a safe, welcoming space for collaboration, and contributors are expected to adhere to the Contributor Covenant code of conduct.

License

The gem is available as open source under the terms of the MIT License.

About

No description, website, or topics provided.

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published