Skip to content

Actions: robusta-dev/holmesgpt

Evaluate LLM test cases

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
238 workflow runs
238 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Update poetry precommit
Evaluate LLM test cases #88: Commit b53741b pushed by moshemorad
December 4, 2024 11:58 1m 27s fix_lock_file
December 4, 2024 11:58 1m 27s
Add precommit
Evaluate LLM test cases #87: Commit f886fe5 pushed by moshemorad
December 4, 2024 11:47 1m 59s fix_lock_file
December 4, 2024 11:47 1m 59s
holmes to use grep logs more by tweaking instruction, move to grep -E…
Evaluate LLM test cases #86: Commit 3d88e0d pushed by nherment
December 4, 2024 11:31 1m 10s add_tool_argocd
December 4, 2024 11:31 1m 10s
test: update correct answer for test_ask_holmes/01_how_many_pods
Evaluate LLM test cases #85: Commit 330c2e1 pushed by nherment
December 4, 2024 11:16 2m 5s llm_eval_further_improvements
December 4, 2024 11:16 2m 5s
holmes to use grep logs more by tweaking instruction, move to grep -E…
Evaluate LLM test cases #83: Commit 3d88e0d pushed by moshemorad
December 4, 2024 10:05 1m 3s 0.7.0
December 4, 2024 10:05 1m 3s
tests: move all llm evals to use correctness
Evaluate LLM test cases #82: Commit bde7b5e pushed by nherment
December 4, 2024 10:05 1m 9s llm_eval_further_improvements
December 4, 2024 10:05 1m 9s
Fixed usage of internet tools
Evaluate LLM test cases #81: Commit e3bce1c pushed by itisallgood
December 3, 2024 14:15 1m 5s main-2395-toolset-integrations
December 3, 2024 14:15 1m 5s
Fixed mock_tools llm tests
Evaluate LLM test cases #80: Commit 54af56c pushed by itisallgood
December 3, 2024 14:14 1m 18s main-2395-toolset-integrations
December 3, 2024 14:14 1m 18s
tests: improve llm evals
Evaluate LLM test cases #79: Commit f97e506 pushed by nherment
December 3, 2024 13:20 1m 15s llm_eval_further_improvements
December 3, 2024 13:20 1m 15s
Removed check_prerequisties from internet and finding
Evaluate LLM test cases #78: Commit fcc13fd pushed by itisallgood
December 3, 2024 12:38 1m 3s main-2395-toolset-integrations
December 3, 2024 12:38 1m 3s
tests: improve LLM evals, prefer correctness over faithfulness score
Evaluate LLM test cases #77: Commit fb4693c pushed by nherment
December 3, 2024 12:26 1m 27s llm_eval_further_improvements
December 3, 2024 12:26 1m 27s
Added comments and improvements for sync holmes
Evaluate LLM test cases #76: Commit 1e445ec pushed by itisallgood
December 3, 2024 08:35 1m 17s main-2395-toolset-integrations
December 3, 2024 08:35 1m 17s
Made docs_url non mandatory for toolsets
Evaluate LLM test cases #75: Commit 4fe31a2 pushed by itisallgood
December 2, 2024 15:39 1m 18s main-2395-toolset-integrations
December 2, 2024 15:39 1m 18s
Updated poetry.lock
Evaluate LLM test cases #74: Commit 2426d7e pushed by itisallgood
December 2, 2024 11:32 1m 6s main-2395-toolset-integrations
December 2, 2024 11:32 1m 6s
Merge remote-tracking branch 'origin/master' into llm_eval_improvements
Evaluate LLM test cases #71: Commit 4acaf7d pushed by nherment
December 2, 2024 07:11 1m 19s llm_eval_improvements
December 2, 2024 07:11 1m 19s
chore: unpin playwright dependency
Evaluate LLM test cases #70: Commit 726f8a9 pushed by nherment
December 2, 2024 06:43 1m 8s llm_eval_improvements
December 2, 2024 06:43 1m 8s
Merge branch 'master' into main-2395-toolset-integrations
Evaluate LLM test cases #69: Commit 5f42fed pushed by itisallgood
December 2, 2024 04:22 1m 8s main-2395-toolset-integrations
December 2, 2024 04:22 1m 8s
Refactored holmes_sync and create_toolcalling_llm
Evaluate LLM test cases #68: Commit 54c2b77 pushed by itisallgood
December 2, 2024 04:07 1m 8s main-2395-toolset-integrations
December 2, 2024 04:07 1m 8s
Refactored code related to ALLOWED_TOOLSETS
Evaluate LLM test cases #67: Commit 817db7d pushed by itisallgood
November 30, 2024 23:39 1m 6s main-2395-toolset-integrations
November 30, 2024 23:39 1m 6s
Updated kubernetes tools definitions
Evaluate LLM test cases #66: Commit e0f2a05 pushed by itisallgood
November 30, 2024 21:39 1m 7s main-2395-toolset-integrations
November 30, 2024 21:39 1m 7s
Added new values for DEFAULT and ALLOWED toolsets envs
Evaluate LLM test cases #65: Commit 5dcb379 pushed by itisallgood
November 30, 2024 21:36 59s main-2395-toolset-integrations
November 30, 2024 21:36 59s
Fixed usage of CUSTOM_TOOLSET_LOCATION variable
Evaluate LLM test cases #64: Commit 3b6a8c6 pushed by itisallgood
November 30, 2024 21:29 1m 14s main-2395-toolset-integrations
November 30, 2024 21:29 1m 14s