Document current implementation in gherkin #14

bwilczek · 2024-04-05T17:48:43Z

Main use cases to be covered (from pairing session with with @id-ilych)

Populate knowledge base. Use CLI -> fetch from Confluence (pages with all the comments) -> SQL -> OpenAI generate embeddings -> vector db

Q&A. User posts a message to slack channel the bot is invited to. Slack listener receives the event -> detects if it is a question (if it has ? at the end) -> generates embedding for the question (using OpenAI) -> retrieves most relevant documents from vector database -> builds a message that includes content of relevant pages + user’s question -> creates a conversation with Open AI (thread in an assistant) -> wait for a response -> post to Slack as a response (in a thread) -> store it in database (including association between assistant thread id and slack thread id)

Feedback. User posts a message to a thread started from the original question -> same as Q&A but instead of creating new conversation it posts to an exiting one.

Identify knowledge gaps. Use CLI -> initiate the process by selecting a topic -> it then uses embeddings and OpenAI to select unanswered questions about that topic -> unanswered questions are posted to specific slack channel -> users can answer them in threads -> bookmarked conversations are uploaded to Confluence (dedicated space).

Gamification (cross-concern). Asking questions rewarded with seeker points. When you question was unanswered you get revealer points during knowledge gap discovery. Luminary points are given to those who answer knowledge gap questions

id-ilych · 2024-04-15T08:35:29Z

features/knowledge_gaps.feature

+    And corresponding documents are not found in the database
+    And question "What power superman has?" is saved as unanswered


Documents are always found (at least with the current implementation, though maybe we could set a distance threshold. So it is up to OpenAI to recognize that given documents don't have info about that.

id-ilych · 2024-04-15T08:36:22Z

features/knowledge_gaps.feature

+    Then listener asks OpenAI for embeddings for the given phrase
+    And corresponding documents are not found in the database
+    And question "What power superman has?" is saved as unanswered
+    And I get rewarded 1 revealer point


Revealer points are rewarded only when knowlegde gap search is initiated. At least with the current implementation.

id-ilych · 2024-04-15T08:38:14Z

features/populate_knowledge_base.feature

+      | Holiday Policy |
+    And binary document representations are stored in vector database
+      | title           | embeddings      |
+      | Holiday Policy  | <format t.b.d.> |


Just in case - vector db stores page ids, not titles. Though I think we could build a matcher that will transform it automatically.

bwilczek self-assigned this Apr 5, 2024

bwilczek force-pushed the devx-3878-document-current-implementation-in-gherkin branch 2 times, most recently from b6c1c17 to 45eba5b Compare April 11, 2024 06:46

Document current implementation in gherkin

2db00e3

bwilczek force-pushed the devx-3878-document-current-implementation-in-gherkin branch from 39093d1 to 2db00e3 Compare April 15, 2024 07:41

bwilczek marked this pull request as ready for review April 15, 2024 07:41

id-ilych approved these changes Apr 15, 2024

View reviewed changes

bwilczek closed this Apr 16, 2024

bwilczek deleted the devx-3878-document-current-implementation-in-gherkin branch April 16, 2024 08:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document current implementation in gherkin #14

Document current implementation in gherkin #14

bwilczek commented Apr 5, 2024 •

edited

Loading

id-ilych Apr 15, 2024

id-ilych Apr 15, 2024

id-ilych Apr 15, 2024

		And corresponding documents are not found in the database
		And question "What power superman has?" is saved as unanswered

Document current implementation in gherkin #14

Document current implementation in gherkin #14

Conversation

bwilczek commented Apr 5, 2024 • edited Loading

id-ilych Apr 15, 2024

Choose a reason for hiding this comment

id-ilych Apr 15, 2024

Choose a reason for hiding this comment

id-ilych Apr 15, 2024

Choose a reason for hiding this comment

bwilczek commented Apr 5, 2024 •

edited

Loading