Skip to content

📚 [Contribution] ebook2audiobook roadmap #32

@DrewThomasson

Description

@DrewThomasson

All Features open to public Contributions ⭐

  • Preview Blocks/Chapters before to start the conversion
  • Edit by sentence converted for surgical text and SML tags changes
  • Change voice per chapter or sentence with SML tags
  • -h -help parameter info in different languages
  • OCR scanning for PDF / JPG / BMP / PNG / TIFF
  • Notebooks Folder Talked about here
  • Make Chinese text splitting not split words and improve pause timing Talked about here
  • Get Kaggel Notebook working
  • Get Working Google Colab Notebook Talked about here
  • Make a ios app
  • Make an android app
  • Audiobookshelf integration

Wanted Extra Parameters

  • Ebook Translation option
  • Output format choicec
  • Batch ebook folder
  • Multiprocessing conversion
  • Make ebook input parameter accept a folder containing ebook files to auto-run through.
  • GPU Device detection and install the right torch/torchaudio pkg
  • Denoise any reference audio for upload voice cloning,
  • Custom model dir input for pointing to a folder containing all of the custom model files if available instead of having to point to each model file individually

TTS engines integration

Create Readme in these languages

  • Arabic (ara)
  • Chinese (zho)
  • English (eng)
  • Spanish (spa)
  • French (fra)
  • German (deu)
  • Italian (ita)
  • Portuguese (por)
  • Polish (pol)
  • Turkish (tur)
  • Russian (rus)
  • Dutch (nld)
  • Czech (ces)
  • Japanese (jpn)
  • Hindi (hin)
  • Bengali (ben)
  • Hungarian (hun)
  • Korean (kor)
  • Vietnamese (vie)
  • Swedish (swe)
  • Persian (fas)
  • Yoruba (yor)
  • Swahili (swa)
  • Indonesian (ind)
  • Slovak (slk)
  • Croatian (hrv)

🐍 Compatibility

  • 🍎 Mac Intel x86
  • 🪟 Windows x86
  • 🐧 Linux x86
  • 🖥️🍏 Apple Silicon Mac
  • 🪟💪 ARM Windows
  • 🐧💪 ARM Linux

Extra Overkill for training models and such (All supported Coqui-tts models and piper-tts in one easy command)

Auto-testing scripts for development

Python Code normalization information for contributors

  • no blank line between code, unless between functions and classes.
  • single quote used for all key unless for dict() and json. dict['key'] always called with single quote
  • 4 spaces indentation, not tab at all
  • strict typing for all functions and its arguments declaration and return values
  • no space between the argument and its typing, no space between the function, the "->" and the return value

Example:

import json
from typing import Optional

def get_user(user_id:int, users:list[dict])->Optional[dict]:
    for user in users:
        if user['id'] == user_id:
            return user
    return None

def summarize(user:dict)->str:
    return f"User {user['name']} is {'active' if user['is_active'] else 'inactive'}."

def to_json(user:dict)->str:
    return json.dumps({"id": user['id'], "name": user['name'], "email": user['email']})

users:list = [
    dict(id=1, name='alice', email='[email protected]', role='admin', is_active=True),
    dict(id=2, name='bob', email='[email protected]', role='editor', is_active=False),
    dict(id=3, name='carol', email='[email protected]', role='viewer', is_active=True),
]
config = {
    "max_users": 100,
    "default_role": "viewer",
    "allow_signup": True,
}
roles = ['admin', 'editor', 'viewer']
found = get_user(1, users)
if found:
    print(summarize(found))
    print(found['email'])
    print(to_json(found))
if config['default_role'] in roles:
    print(config['default_role'])

Hardware donation for beta tests wanted

We accept any kind of hardware to test our development like:

  • Nvidia supporting cuda >= 11.8
  • XPU intel cards
  • ROCm AMD cards supporting ROCm >=5.7

@DrewThomasson if you want to help out at all! 😃

Metadata

Metadata

Labels

DocsImprovements or additions to documentationHelp NeededExtra attention is needed

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions