You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Below is a mockup of the interface we're aiming for:
importtopicexplorerte=topicexplorer.from_config('ap.ini')
# access the corpus with .corpuste.corpus# access the individual models with dictionary attributesassertisinstance(te[k], LdaCgsViewer)
te[k].thetate[k].phi# comparing two models using the interfaceimporttopicexplorer.analysistopicexplorer.analysis.model_dist(te[20], te[40])
# integrated past_to_text analysisordered_ids= ['some', 'labels', 'by', 'date']
p2t=topicexplorer.analysis.past_to_text(te[20], ordered_ids)
### returns raw numbers# possible plot library?importtopicexplorer.analysis.plottopicexplorer.analysis.plot.past_to_text(p2t)
This is too much for a single ticket, and definitely more of what I'm thinking for a 2.0, but I want to get at least to the point where the models are loaded with topicexplorer.from_config() in notebooks.
The text was updated successfully, but these errors were encountered:
@colinallen Referring to your original comments on what happens when models are incommensurate, the methods I have reduce the vocabulary to the union of the two corpora and only compare topic distance on the remaining terms, but do not re-normalize the distributions. This at least maintains that we have a probabilistic source signal yielding tokens, and then non-assigned portions of the distribution (that is the parts of the vocabulary in the difference) do not contribute to the model distance.
importtopicexplorerte=topicexplorer.from_config('sep.ini')
# use dictionary access to get the tokensassertte.corpus['neo-kantianism'] == [25, 37, 141312, 12, ...]
# assert a document label is in the Corpus objectassert'neo-kantianism'inte.corpus
Originally raised in #150
Below is a mockup of the interface we're aiming for:
Some other thoughts:
This is too much for a single ticket, and definitely more of what I'm thinking for a 2.0, but I want to get at least to the point where the models are loaded with
topicexplorer.from_config()
in notebooks.The text was updated successfully, but these errors were encountered: