coMedia

This project has two purposes. First, to provide a personal repository of appreciated content. One where you can store all that we read, watch, listen to, and care to remember in the future. A sort of second brain. Ssecond, to create a universal, collaborative database of quality consumer media, encoded in a way that allows high expressive queries from users.

By contributing to coMedia, this is what you get. A repository of your media interests, and a powerful media search engine for you to query.

How does it work

There are three main functionalities. Searching for content, adding commentaries, and listing commentaries. These are accessible through a simple command line interface. To start, run:

python main.py

Enter a user Id. This should be a distinct name, to keep your contributions together. Next, enter which action you wish to perform. One of 'add', 'search and 'review'. This is an example of search:

Searching content

The search engine considers both the descriptions and the comments provided by users. These are embedded by an LLM, and matched against the search query. The most similar contents are retrieved, and shown as results to the user.

To build effective queries, follow the same guidelines expressed for adding content. That is, include a brief description of what you are looking for, in which language, how you would like it to be, what you want it for, etc.

Adding content

Whenever a user wants to store its own content, it must provide with the following:

(Optional) A set of bibliographic details for matching: Author, date, location, language.
A description of the digital content, as objective as possible. Between 3 and 10 sentences.
+/-: A comment on the content, including, at for example, the best and worst parts of it, or in which context you think is best for consumption. Never include here personal or identifiable information.

After adding the commentary, the coMedia system will locate the most likely content entries already in the system, trying to match the new comment to existing content. This matching is done through a LLM embedding similarity of the available descriptions. Upon reviewing the most likely options, the users confirms one of the options, or rejects them all. If its confirmed, the commentary becomes associated with the existing content. If all are rejected, a new content is created using the bibliographic information and decription provided by the user.

Listing content

Simply, get all the comments added by the active user.

Embedding actions

As a working prototype, considering the many changes happening in the persisted data, an external procedure is provided, so that one can produce the LLM embeddings for all data. This is computed through the embed_main.py call, which processes the xml file and generates the embeddings and stores them in a pkl.

In other words, you need to run 'python embed_main.py' after adding some comments to the system, so these are embedded by the LLM model and accessible through the search functionality.

Disclaimer

This is the prototype of a prototype. Important pieces are missing, such as proper user management, interface and privacy (e.g., data anonymization). Dont ever submit personal information, misinformation, or any other sort of illegal content.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
data		data
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
embed_main.py		embed_main.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

coMedia

How does it work

Searching content

Adding content

Listing content

Embedding actions

Disclaimer

About

Releases

Packages

Languages

License

dariogarcia/coMedia

Folders and files

Latest commit

History

Repository files navigation

coMedia

How does it work

Searching content

Adding content

Listing content

Embedding actions

Disclaimer

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages