English | 中文
All the code and documentation are entirely generated by OpenAI's GPT-4 Model
OpenAI Translator is an AI-powered translation tool designed to translate English PDF books to Chinese. The tool leverages large language models (LLMs) like ChatGLM and OpenAI's GPT-3 and GPT-3.5 Turbo for translation. It's built in Python and has a flexible, modular, and object-oriented design.
In the current landscape, there's a lack of non-commercial yet efficient PDF translation tools. Many users have PDF documents with sensitive data that they prefer not to upload to public commercial service websites due to privacy concerns. This project was developed to address this gap, providing a solution for users who need to translate their PDFs while maintaining data privacy.
The OpenAI Translator is still in its early stages of development, and I'm actively working on adding more features and improving its performance. We appreciate any feedback or contributions!
"The Old Man and the Sea"
- Translation of English PDF books to Chinese using LLMs.
- Support for both ChatGLM and OpenAI models.
- Flexible configuration through a YAML file or command-line arguments.
- Timeouts and error handling for robust translation operations.
- Modular and object-oriented design for easy customization and extension.
- Add support for other languages and translation directions.
- Implement a graphical user interface (GUI) for easier use.
- Implement stylized translation
- Implement translation of first x pages
- Create a web service or API to enable usage in web applications.
- Add support for batch processing of multiple PDF files.
- Add support for preserving the original layout and formatting of the source PDF.
- Improve translation quality by using custom-trained translation models.
1.Clone the repository.
2.The OpenAI-Translator
requires Python 3.10 or later. Install the dependencies with pip install -r requirements.txt
.
3.Set up your OpenAI API key($OPENAI_API_KEY
). You can either add it to your environment variables or specify it in the config.yaml file.
You can use OpenAI-Translator either by specifying a configuration file or by providing command-line arguments.
Adapt config.yaml
file with your settings:
model_name: "gpt-3.5-turbo"
input_file: "tests/test.pdf"
output_file_format: "markdown"
source_language: "English"
target_language: "Chinese"
Then run the tool:
python ai_translator/main.py
You can also specify the settings directly on the command line. Here's an example of how to use the OpenAI model:
# Set your api_key as an env variable
export OPENAI_API_KEY="sk-xxx"
python ai_translator/main.py --model_name "gpt-3.5-turbo" --input_file "your_input.pdf" --output_file_format "markdown" --source_language "English" --target_language "Chinese"
This project is licensed under the GPL-3.0 License. See the LICENSE file for details.