chatllm.kt

kotlin api wrapper for llm-inference chatllm.cpp

All credits go to original repo: https://github.com/foldl/chatllm.cpp and Qwen 2.5 32b Coder Instruct which made 99% of work. I only guided it with prompts.

Generation of C interop bindings:

cinterop -def libchatllm.def -o C:\ktnt\bin\libchatllm_interop

Compilation of executable:

kotlinc-native main.kt -library C:\your_path_to_kotlin_native\bin\libchatllm_interop.klib -linker-options -L. -linker-options -llibchatllm -o main

Then launch executable like this: main.exe -m qwen2.5-1.5b.bin

Links for quantatized models:

QWen-2.5 1.5B - https://modelscope.cn/api/v1/models/judd2024/chatllm_quantized_qwen2.5/repo?Revision=master&FilePath=qwen2.5-1.5b.bin

Gemma-2 2B - https://modelscope.cn/api/v1/models/judd2024/chatllm_quantized_gemma2_2b/repo?Revision=master&FilePath=gemma2-2b.bin

If you need more quantatized models use this python model downloader: https://github.com/foldl/chatllm.cpp/blob/master/scripts/model_downloader.py

You can convert custom safetensors model to inner chatllm.cpp format by using this script: https://github.com/foldl/chatllm.cpp/blob/master/convert.py

Converting tutorial: https://github.com/foldl/chatllm.cpp?tab=readme-ov-file#quantize-model

List of supported llm architecture types suitable for conversion: https://github.com/foldl/chatllm.cpp/blob/master/docs/models.md

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Main.kt		Main.kt
README.md		README.md
ggml.dll		ggml.dll
libchatllm.def		libchatllm.def
libchatllm.dll		libchatllm.dll
libchatllm.h		libchatllm.h
libchatllm.lib		libchatllm.lib

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

chatllm.kt

About

Releases 1

Packages

Languages

JohnClaw/chatllm.kt

Folders and files

Latest commit

History

Repository files navigation

chatllm.kt

About

Topics

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages