Skip to content
This repository has been archived by the owner on Apr 29, 2024. It is now read-only.

Document current implementation in gherkin #14

Closed

Conversation

bwilczek
Copy link

@bwilczek bwilczek commented Apr 5, 2024

Main use cases to be covered (from pairing session with with @id-ilych)

  1. Populate knowledge base. Use CLI -> fetch from Confluence (pages with all the comments) -> SQL -> OpenAI generate embeddings -> vector db
  2. Q&A. User posts a message to slack channel the bot is invited to. Slack listener receives the event -> detects if it is a question (if it has ? at the end) -> generates embedding for the question (using OpenAI) -> retrieves most relevant documents from vector database -> builds a message that includes content of relevant pages + user’s question -> creates a conversation with Open AI (thread in an assistant) -> wait for a response -> post to Slack as a response (in a thread) -> store it in database (including association between assistant thread id and slack thread id)
  3. Feedback. User posts a message to a thread started from the original question -> same as Q&A but instead of creating new conversation it posts to an exiting one.
  4. Identify knowledge gaps. Use CLI -> initiate the process by selecting a topic -> it then uses embeddings and OpenAI to select unanswered questions about that topic -> unanswered questions are posted to specific slack channel -> users can answer them in threads -> bookmarked conversations are uploaded to Confluence (dedicated space).
  5. Gamification (cross-concern). Asking questions rewarded with seeker points. When you question was unanswered you get revealer points during knowledge gap discovery. Luminary points are given to those who answer knowledge gap questions

@bwilczek bwilczek self-assigned this Apr 5, 2024
@bwilczek bwilczek force-pushed the devx-3878-document-current-implementation-in-gherkin branch 2 times, most recently from b6c1c17 to 45eba5b Compare April 11, 2024 06:46
@bwilczek bwilczek force-pushed the devx-3878-document-current-implementation-in-gherkin branch from 39093d1 to 2db00e3 Compare April 15, 2024 07:41
@bwilczek bwilczek marked this pull request as ready for review April 15, 2024 07:41
Comment on lines +36 to +37
And corresponding documents are not found in the database
And question "What power superman has?" is saved as unanswered

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Documents are always found (at least with the current implementation, though maybe we could set a distance threshold. So it is up to OpenAI to recognize that given documents don't have info about that.

Then listener asks OpenAI for embeddings for the given phrase
And corresponding documents are not found in the database
And question "What power superman has?" is saved as unanswered
And I get rewarded 1 revealer point

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Revealer points are rewarded only when knowlegde gap search is initiated. At least with the current implementation.

| Holiday Policy |
And binary document representations are stored in vector database
| title | embeddings |
| Holiday Policy | <format t.b.d.> |

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just in case - vector db stores page ids, not titles. Though I think we could build a matcher that will transform it automatically.

@bwilczek bwilczek closed this Apr 16, 2024
@bwilczek bwilczek deleted the devx-3878-document-current-implementation-in-gherkin branch April 16, 2024 08:16
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants