Skip to content

waverdeep/CleanScript

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

37 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

CleanScript

AI Hubμ—μ„œ μ œκ³΅λ˜λŠ” KsponSpeech Open Data μ „μ²˜λ¦¬λ₯Ό μ§„ν–‰ν•˜λŠ” ν”„λ‘œμ νŠΈ

Output ν˜•νƒœ

  • Nvidia Jasper Model 의 ν•™μŠ΅ 데이터셋 format ν˜•νƒœ
  • Clova Call Model 의 ν•™μŠ΅ 데이터셋 format ν˜•νƒœ

Libraries

  • csv
  • json
  • os
  • natsort

Functions

fileOI.get_divided_script

AI Hubμ—μ„œ μ œκ³΅λ˜λŠ” KsponDatasetκ³Ό 같이 μŒμ›λ³„ μŠ€ν¬λ¦½νŠΈκ°€ pair 둜 μ œμž‘λ˜μ–΄μžˆλŠ” ν˜•νƒœμ—μ„œ text script의 filepath만 λͺ¨λ‘ κ°€μ Έμ™€μ„œ λ¦¬μŠ€νŠΈμ— λ‹΄λŠ” ν•¨μˆ˜ natsort 라이브러리λ₯Ό 톡해 ν•΄λ‹Ή νŒŒμΌλ“€μ„ μ˜€λ¦„μ°¨μˆœμœΌλ‘œ μ •λ ¬

  • input_dir : μ–΄λ–€ λ””λ ‰ν† λ¦¬μ—μ„œ νŒŒμΌλ“€μ„ 찾을것인지에 λŒ€ν•œ μƒμœ„ 디렉토리 경둜
  • file_extension : μ–΄λ–€ ν™•μž₯자λ₯Ό κ°€μ§„ νŒŒμΌμ„ λ¦¬μŠ€νŠΈν™” μ‹œν‚¬κ²ƒμΈμ§€μ— λŒ€ν•œ ν™•μž₯자 (default : txt)

script_preprocess.merge_script_like_clova_call

Github Open Source 쀑 clovaai의 ClocaCall model의 ν•™μŠ΅λ°μ΄ν„°μ…‹μ— λ§žλŠ” ν˜•νƒœλ‘œ μ œμž‘ν•˜κΈ° μœ„ν•œ ν•¨μˆ˜

ClovaCall.json

[
  {
    "wav" : "42_0603_748_0_03319_00.wav",
    "text : "단체 할인이 κ°€λŠ₯ν•œ μ‹œκ°„λŒ€κ°€ λ”°λ‘œ μžˆλ‚˜μš”?",
    "speaker_id" : "03319"
  },
  ...,
  {
    "wav" : "42_0610_778_0_03607_01.wav",
    "text" : "애기듀이 λ†€λ§Œν•œ 놀이방이 λ”°λ‘œ μžˆλ‚˜μš”?",
    "speaker_id" : "03607"
  }
]  

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages