Auto-Landmark

This is open source toolkit for speech landmark detection Here's the English version of the introduction:

Introduction

This toolkit provides an efficient and accurate method for extracting acoustic landmarks, assisting researchers and developers in identifying key acoustic events in speech signal processing. The definition of landmarks is based on the paper "Landmark detection for distinctive feature-based speech recognition", offering a framework for landmark detection grounded in distinctive speech features. Our toolkit builds upon the methodology proposed in the paper "Auto-Landmark: Acoustic Landmark Dataset and Open-Source Toolkit for Landmark Extraction", implementing an open-source solution that allows users to extract and analyze acoustic landmarks in their own projects seamlessly.

Here’s an additional section to include before the "Basic Use" section, based on the information in the table:

Basic Use

To extract acoustic landmarks from an audio file, you can use the following example. First, import the extract_all_landmarks function and pass the path of the audio file as an argument:

from methods.Basic.Landmarks_func import extract_all_landmarks

# Example of extracting landmarks from an audio file
landmarks = extract_all_landmarks("path/to/your/audiofile.wav")

In this example, replace "path/to/your/audiofile.wav" with the actual file path of the audio file you want to process. The function will return the detected landmarks, which can then be used for further analysis.

Here's a general description of the extracted landmarks:

The toolkit extracts various acoustic landmarks from an audio file, providing their occurrence times within the signal. Each landmark type is represented by a key, with an associated list of timestamps where these landmarks were detected. The keys correspond to specific types of acoustic events, such as:

'g+': Glottal Onsets
'g-': Glottal Offsets
's+': Syllabic Onsets
's-': Syllabic Offsets
'b+': Burst Onsets
'b-': Burst Offsets
'v+': Voiced Frication Onsets
'v-': Voiced Frication Offsets
'f+': Frication Onsets
'f-': Frication Offsets
'p+': Periodicity Onsets
'p-': Periodicity Offsets

Each list contains the times (in seconds) at which the corresponding landmarks were detected in the audio file. If no landmarks of a specific type are found, an empty list is returned.

Here's the "Recipe Use" section indicating that it is currently under development:

Recipe Use

We are actively developing detailed recipes for various use cases, including extracting landmarks from different datasets, customizing detection parameters, and integrating with other speech processing pipelines. These recipes will provide step-by-step instructions to help users tailor the toolkit for their specific needs. Stay tuned for updates and new features in upcoming releases!

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
egs		egs
methods		methods
utils		utils
visualisation		visualisation
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Auto-Landmark

Introduction

Basic Use

Recipe Use

About

Releases

Packages

Contributors 3

Languages

Tonyyouyou/Auto-Landmark

Folders and files

Latest commit

History

Repository files navigation

Auto-Landmark

Introduction

Basic Use

Recipe Use

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages